Overview
Brought to you by YData
Dataset statistics
| Number of variables | 112 |
|---|---|
| Number of observations | 186529 |
| Missing cells | 6512992 |
| Missing cells (%) | 31.2% |
| Total size in memory | 159.4 MiB |
| Average record size in memory | 896.0 B |
Variable types
| Text | 112 |
|---|
Dataset
| Description | Botany Division, Yale Peabody Museum 0061682-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.twf535 |
accessRights has constant value "Open Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj" | Constant |
language has constant value "en" | Constant |
license has constant value "CC0_1_0" | Constant |
publisher has constant value "Yale University Peabody Museum" | Constant |
rightsHolder has constant value "Yale Peabody Museum" | Constant |
type has constant value "PhysicalObject" | Constant |
institutionCode has constant value "YPM" | Constant |
ownerInstitutionCode has constant value "YPM" | Constant |
basisOfRecord has constant value "PRESERVED_SPECIMEN" | Constant |
individualCount has constant value "1" | Constant |
occurrenceStatus has constant value "PRESENT" | Constant |
preparations has constant value "tissue (frozen)" | Constant |
disposition has constant value "in collection" | Constant |
nomenclaturalCode has constant value "ICBN" | Constant |
taxonRemarks has constant value "Animals and Plants: Plants" | Constant |
datasetKey has constant value "963f12d0-f762-11e1-a439-00145eb45e9a" | Constant |
publishingCountry has constant value "US" | Constant |
protocol has constant value "EML" | Constant |
lastCrawled has constant value "2025-01-07T13:01:58.967Z" | Constant |
isSequenced has constant value "false" | Constant |
publishedByGbifRegion has constant value "NORTH_AMERICA" | Constant |
recordNumber has 139017 (74.5%) missing values | Missing |
recordedBy has 75764 (40.6%) missing values | Missing |
reproductiveCondition has 186504 (> 99.9%) missing values | Missing |
preparations has 186476 (> 99.9%) missing values | Missing |
associatedReferences has 176462 (94.6%) missing values | Missing |
associatedTaxa has 185782 (99.6%) missing values | Missing |
eventDate has 84019 (45.0%) missing values | Missing |
startDayOfYear has 103374 (55.4%) missing values | Missing |
endDayOfYear has 103374 (55.4%) missing values | Missing |
year has 84248 (45.2%) missing values | Missing |
month has 93636 (50.2%) missing values | Missing |
day has 104750 (56.2%) missing values | Missing |
habitat has 157729 (84.6%) missing values | Missing |
higherGeography has 72099 (38.7%) missing values | Missing |
continent has 73143 (39.2%) missing values | Missing |
waterBody has 183495 (98.4%) missing values | Missing |
countryCode has 72482 (38.9%) missing values | Missing |
stateProvince has 78016 (41.8%) missing values | Missing |
county has 98586 (52.9%) missing values | Missing |
municipality has 110052 (59.0%) missing values | Missing |
locality has 125307 (67.2%) missing values | Missing |
verbatimElevation has 178933 (95.9%) missing values | Missing |
decimalLatitude has 82100 (44.0%) missing values | Missing |
decimalLongitude has 82100 (44.0%) missing values | Missing |
coordinateUncertaintyInMeters has 82138 (44.0%) missing values | Missing |
georeferencedBy has 182211 (97.7%) missing values | Missing |
georeferencedDate has 174887 (93.8%) missing values | Missing |
georeferenceProtocol has 82331 (44.1%) missing values | Missing |
georeferenceSources has 83888 (45.0%) missing values | Missing |
georeferenceRemarks has 85474 (45.8%) missing values | Missing |
typeStatus has 182608 (97.9%) missing values | Missing |
identifiedBy has 180415 (96.7%) missing values | Missing |
dateIdentified has 184582 (99.0%) missing values | Missing |
identificationRemarks has 182833 (98.0%) missing values | Missing |
phylum has 28431 (15.2%) missing values | Missing |
class has 28457 (15.3%) missing values | Missing |
order has 28496 (15.3%) missing values | Missing |
family has 28710 (15.4%) missing values | Missing |
genus has 28788 (15.4%) missing values | Missing |
genericName has 28825 (15.5%) missing values | Missing |
specificEpithet has 54371 (29.1%) missing values | Missing |
infraspecificEpithet has 182164 (97.7%) missing values | Missing |
elevation has 178933 (95.9%) missing values | Missing |
elevationAccuracy has 185793 (99.6%) missing values | Missing |
distanceFromCentroidInMeters has 186092 (99.8%) missing values | Missing |
mediaType has 9347 (5.0%) missing values | Missing |
phylumKey has 28431 (15.2%) missing values | Missing |
classKey has 28457 (15.3%) missing values | Missing |
orderKey has 28496 (15.3%) missing values | Missing |
familyKey has 28710 (15.4%) missing values | Missing |
genusKey has 28788 (15.4%) missing values | Missing |
speciesKey has 54335 (29.1%) missing values | Missing |
species has 54335 (29.1%) missing values | Missing |
repatriated has 72482 (38.9%) missing values | Missing |
gbifRegion has 72484 (38.9%) missing values | Missing |
level0Gid has 86228 (46.2%) missing values | Missing |
level0Name has 86228 (46.2%) missing values | Missing |
level1Gid has 86228 (46.2%) missing values | Missing |
level1Name has 86228 (46.2%) missing values | Missing |
level2Gid has 87766 (47.1%) missing values | Missing |
level2Name has 87766 (47.1%) missing values | Missing |
level3Gid has 178900 (95.9%) missing values | Missing |
level3Name has 178901 (95.9%) missing values | Missing |
iucnRedListCategory has 10881 (5.8%) missing values | Missing |
gbifID has unique values | Unique |
bibliographicCitation has unique values | Unique |
references has unique values | Unique |
dynamicProperties has unique values | Unique |
occurrenceID has unique values | Unique |
catalogNumber has unique values | Unique |
Reproduction
| Analysis started | 2025-01-08 23:32:03.397922 |
|---|---|
| Analysis finished | 2025-01-08 23:32:14.028777 |
| Duration | 10.63 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 186529 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 186529 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1038985783 |
|---|---|
| 2nd row | 1038985820 |
| 3rd row | 1038985793 |
| 4th row | 1805296727 |
| 5th row | 4539832816 |
| Value | Count | Frequency (%) |
| 1038985783 | 1 | < 0.1% |
| 1038985974 | 1 | < 0.1% |
| 1038985864 | 1 | < 0.1% |
| 1805437104 | 1 | < 0.1% |
| 1038985828 | 1 | < 0.1% |
| 1038985793 | 1 | < 0.1% |
| 1805296727 | 1 | < 0.1% |
| 4539832816 | 1 | < 0.1% |
| 1038985782 | 1 | < 0.1% |
| 1038985792 | 1 | < 0.1% |
| Other values (186519) | 186519 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 257934 | |
| 8 | 257321 | |
| 3 | 244061 | |
| 1 | 238732 | |
| 9 | 237662 | |
| 4 | 158719 | |
| 5 | 155085 | |
| 2 | 126849 | |
| 6 | 103154 | 5.5% |
| 7 | 85773 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1865290 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 257934 | |
| 8 | 257321 | |
| 3 | 244061 | |
| 1 | 238732 | |
| 9 | 237662 | |
| 4 | 158719 | |
| 5 | 155085 | |
| 2 | 126849 | |
| 6 | 103154 | 5.5% |
| 7 | 85773 | 4.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1865290 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 257934 | |
| 8 | 257321 | |
| 3 | 244061 | |
| 1 | 238732 | |
| 9 | 237662 | |
| 4 | 158719 | |
| 5 | 155085 | |
| 2 | 126849 | |
| 6 | 103154 | 5.5% |
| 7 | 85773 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1865290 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 257934 | |
| 8 | 257321 | |
| 3 | 244061 | |
| 1 | 238732 | |
| 9 | 237662 | |
| 4 | 158719 | |
| 5 | 155085 | |
| 2 | 126849 | |
| 6 | 103154 | 5.5% |
| 7 | 85773 | 4.6% |
accessRights
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 129 |
|---|---|
| Median length | 129 |
| Mean length | 129 |
| Min length | 129 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Open Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj |
|---|---|
| 2nd row | Open Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj |
| 3rd row | Open Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj |
| 4th row | Open Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj |
| 5th row | Open Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj |
| Value | Count | Frequency (%) |
| open | 186529 | |
| access | 186529 | |
| http://creativecommons.org/publicdomain/zero/1.0 | 186529 | |
| see | 186529 | |
| yale | 186529 | |
| peabody | 186529 | |
| policies | 186529 | |
| at | 186529 | |
| http://hdl.handle.net/10079/8931zqj | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2238348 | 9.3% |
| / | 1865290 | 7.8% |
| 1492232 | 6.2% | |
| t | 1305703 | 5.4% |
| o | 1305703 | 5.4% |
| a | 1119174 | 4.7% |
| c | 1119174 | 4.7% |
| i | 932645 | 3.9% |
| n | 932645 | 3.9% |
| s | 932645 | 3.9% |
| Other values (28) | 10818682 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16228023 | |
| Other Punctuation | 3544051 | 14.7% |
| Decimal Number | 2051819 | 8.5% |
| Space Separator | 1492232 | 6.2% |
| Uppercase Letter | 746116 | 3.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2238348 | |
| t | 1305703 | 8.0% |
| o | 1305703 | 8.0% |
| a | 1119174 | 6.9% |
| c | 1119174 | 6.9% |
| i | 932645 | 5.7% |
| n | 932645 | 5.7% |
| s | 932645 | 5.7% |
| l | 932645 | 5.7% |
| p | 932645 | 5.7% |
| Other values (12) | 4476696 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 559587 | |
| 0 | 559587 | |
| 9 | 373058 | |
| 8 | 186529 | 9.1% |
| 7 | 186529 | 9.1% |
| 3 | 186529 | 9.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1865290 | |
| . | 746116 | 21.1% |
| : | 559587 | 15.8% |
| ; | 186529 | 5.3% |
| , | 186529 | 5.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 186529 | |
| O | 186529 | |
| Y | 186529 | |
| A | 186529 |
Space Separator
| Value | Count | Frequency (%) |
| 1492232 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16974139 | |
| Common | 7088102 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2238348 | |
| t | 1305703 | 7.7% |
| o | 1305703 | 7.7% |
| a | 1119174 | 6.6% |
| c | 1119174 | 6.6% |
| i | 932645 | 5.5% |
| n | 932645 | 5.5% |
| s | 932645 | 5.5% |
| l | 932645 | 5.5% |
| p | 932645 | 5.5% |
| Other values (16) | 5222812 |
Common
| Value | Count | Frequency (%) |
| / | 1865290 | |
| 1492232 | ||
| . | 746116 | 10.5% |
| : | 559587 | 7.9% |
| 1 | 559587 | 7.9% |
| 0 | 559587 | 7.9% |
| 9 | 373058 | 5.3% |
| 8 | 186529 | 2.6% |
| 7 | 186529 | 2.6% |
| 3 | 186529 | 2.6% |
| Other values (2) | 373058 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24062241 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2238348 | 9.3% |
| / | 1865290 | 7.8% |
| 1492232 | 6.2% | |
| t | 1305703 | 5.4% |
| o | 1305703 | 5.4% |
| a | 1119174 | 4.7% |
| c | 1119174 | 4.7% |
| i | 932645 | 3.9% |
| n | 932645 | 3.9% |
| s | 932645 | 3.9% |
| Other values (28) | 10818682 |
Unique 
| Distinct | 186529 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 62 |
|---|---|
| Median length | 55 |
| Mean length | 28.163299 |
| Min length | 15 |
Unique
| Unique | 186529 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Luzula bulbosa (YU.036650) |
|---|---|
| 2nd row | Gentiana clausa (CBS.028950) |
| 3rd row | Carex muhlenbergii (YU.070008) |
| 4th row | Lophocolea minor (YU.204399) |
| 5th row | Plantae (YU.175465) |
| Value | Count | Frequency (%) |
| plantae | 28374 | 5.5% |
| carex | 8803 | 1.7% |
| var | 3699 | 0.7% |
| dryopteris | 2392 | 0.5% |
| sphagnum | 2360 | 0.5% |
| juncus | 1814 | 0.4% |
| frullania | 1708 | 0.3% |
| asplenium | 1557 | 0.3% |
| scapania | 1517 | 0.3% |
| canadensis | 1515 | 0.3% |
| Other values (197634) | 462834 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 389969 | 7.4% |
| 330044 | 6.3% | |
| i | 262458 | 5.0% |
| 0 | 223252 | 4.2% |
| e | 205819 | 3.9% |
| l | 196972 | 3.7% |
| . | 190701 | 3.6% |
| ( | 186530 | 3.6% |
| ) | 186530 | 3.6% |
| r | 175234 | 3.3% |
| Other values (58) | 2905763 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2641527 | |
| Decimal Number | 1119258 | |
| Uppercase Letter | 597975 | 11.4% |
| Space Separator | 330044 | 6.3% |
| Other Punctuation | 190703 | 3.6% |
| Open Punctuation | 186530 | 3.6% |
| Close Punctuation | 186530 | 3.6% |
| Dash Punctuation | 705 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 389969 | |
| i | 262458 | |
| e | 205819 | 7.8% |
| l | 196972 | 7.5% |
| r | 175234 | 6.6% |
| n | 170772 | 6.5% |
| u | 162576 | 6.2% |
| o | 156926 | 5.9% |
| s | 154857 | 5.9% |
| t | 146008 | 5.5% |
| Other values (16) | 619936 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 149178 | |
| Y | 148189 | |
| C | 64443 | |
| S | 55381 | 9.3% |
| P | 49351 | 8.3% |
| B | 44799 | 7.5% |
| A | 13951 | 2.3% |
| L | 10862 | 1.8% |
| D | 7781 | 1.3% |
| R | 6989 | 1.2% |
| Other values (16) | 47051 | 7.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 223252 | |
| 2 | 150900 | |
| 1 | 131446 | |
| 3 | 102608 | |
| 4 | 92196 | |
| 5 | 86628 | 7.7% |
| 8 | 85549 | 7.6% |
| 7 | 83994 | 7.5% |
| 6 | 83823 | 7.5% |
| 9 | 78862 | 7.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 190701 | |
| ? | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 330044 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 186530 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 186530 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 705 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3239502 | |
| Common | 2013770 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 389969 | 12.0% |
| i | 262458 | 8.1% |
| e | 205819 | 6.4% |
| l | 196972 | 6.1% |
| r | 175234 | 5.4% |
| n | 170772 | 5.3% |
| u | 162576 | 5.0% |
| o | 156926 | 4.8% |
| s | 154857 | 4.8% |
| U | 149178 | 4.6% |
| Other values (42) | 1214741 |
Common
| Value | Count | Frequency (%) |
| 330044 | ||
| 0 | 223252 | |
| . | 190701 | |
| ( | 186530 | |
| ) | 186530 | |
| 2 | 150900 | |
| 1 | 131446 | 6.5% |
| 3 | 102608 | 5.1% |
| 4 | 92196 | 4.6% |
| 5 | 86628 | 4.3% |
| Other values (6) | 332935 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5253272 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 389969 | 7.4% |
| 330044 | 6.3% | |
| i | 262458 | 5.0% |
| 0 | 223252 | 4.2% |
| e | 205819 | 3.9% |
| l | 196972 | 3.7% |
| . | 190701 | 3.6% |
| ( | 186530 | 3.6% |
| ) | 186530 | 3.6% |
| r | 175234 | 3.3% |
| Other values (58) | 2905763 |
language
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | en |
|---|---|
| 2nd row | en |
| 3rd row | en |
| 4th row | en |
| 5th row | en |
| Value | Count | Frequency (%) |
| en | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 186529 | |
| n | 186529 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 373058 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 186529 | |
| n | 186529 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 373058 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 186529 | |
| n | 186529 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 373058 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 186529 | |
| n | 186529 |
license
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CC0_1_0 |
|---|---|
| 2nd row | CC0_1_0 |
| 3rd row | CC0_1_0 |
| 4th row | CC0_1_0 |
| 5th row | CC0_1_0 |
| Value | Count | Frequency (%) |
| cc0_1_0 | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 373058 | |
| 0 | 373058 | |
| _ | 373058 | |
| 1 | 186529 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 559587 | |
| Uppercase Letter | 373058 | |
| Connector Punctuation | 373058 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 373058 | |
| 1 | 186529 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 373058 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 373058 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 932645 | |
| Latin | 373058 | 28.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 373058 | |
| _ | 373058 | |
| 1 | 186529 |
Latin
| Value | Count | Frequency (%) |
| C | 373058 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1305703 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 373058 | |
| 0 | 373058 | |
| _ | 373058 | |
| 1 | 186529 |
modified
Text
| Distinct | 7024 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Unique
| Unique | 1517 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | 2023-03-01T19:35:25Z |
|---|---|
| 2nd row | 2020-10-02T23:17:12Z |
| 3rd row | 2020-12-23T21:50:47Z |
| 4th row | 2020-06-26T23:18:45Z |
| 5th row | 2024-03-19T11:52:47Z |
| Value | Count | Frequency (%) |
| 2015-11-29t17:24:32z | 16880 | 9.0% |
| 2020-12-23t21:50:47z | 9978 | 5.3% |
| 2020-08-11t23:38:35z | 9456 | 5.1% |
| 2020-10-02t23:17:12z | 6413 | 3.4% |
| 2022-03-19t21:48:41z | 5153 | 2.8% |
| 2015-11-29t17:24:36z | 5077 | 2.7% |
| 2019-12-07t23:19:07z | 4868 | 2.6% |
| 2015-11-28t13:37:37z | 3604 | 1.9% |
| 2015-11-28t13:37:48z | 3531 | 1.9% |
| 2024-03-20t22:00:25z | 3149 | 1.7% |
| Other values (7014) | 118420 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 718500 | |
| 0 | 500102 | |
| 1 | 466016 | |
| - | 373058 | |
| : | 373058 | |
| 3 | 227531 | 6.1% |
| 4 | 206600 | 5.5% |
| T | 186529 | 5.0% |
| Z | 186529 | 5.0% |
| 5 | 159192 | 4.3% |
| Other values (4) | 333465 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2611406 | |
| Dash Punctuation | 373058 | 10.0% |
| Other Punctuation | 373058 | 10.0% |
| Uppercase Letter | 373058 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 718500 | |
| 0 | 500102 | |
| 1 | 466016 | |
| 3 | 227531 | 8.7% |
| 4 | 206600 | 7.9% |
| 5 | 159192 | 6.1% |
| 7 | 107197 | 4.1% |
| 8 | 80992 | 3.1% |
| 9 | 79600 | 3.0% |
| 6 | 65676 | 2.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 186529 | |
| Z | 186529 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 373058 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 373058 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3357522 | |
| Latin | 373058 | 10.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 718500 | |
| 0 | 500102 | |
| 1 | 466016 | |
| - | 373058 | |
| : | 373058 | |
| 3 | 227531 | 6.8% |
| 4 | 206600 | 6.2% |
| 5 | 159192 | 4.7% |
| 7 | 107197 | 3.2% |
| 8 | 80992 | 2.4% |
| Other values (2) | 145276 | 4.3% |
Latin
| Value | Count | Frequency (%) |
| T | 186529 | |
| Z | 186529 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3730580 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 718500 | |
| 0 | 500102 | |
| 1 | 466016 | |
| - | 373058 | |
| : | 373058 | |
| 3 | 227531 | 6.1% |
| 4 | 206600 | 5.5% |
| T | 186529 | 5.0% |
| Z | 186529 | 5.0% |
| 5 | 159192 | 4.3% |
| Other values (4) | 333465 |
publisher
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 30 |
|---|---|
| Median length | 30 |
| Mean length | 30 |
| Min length | 30 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Yale University Peabody Museum |
|---|---|
| 2nd row | Yale University Peabody Museum |
| 3rd row | Yale University Peabody Museum |
| 4th row | Yale University Peabody Museum |
| 5th row | Yale University Peabody Museum |
| Value | Count | Frequency (%) |
| yale | 186529 | |
| university | 186529 | |
| peabody | 186529 | |
| museum | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 746116 | 13.3% |
| 559587 | 10.0% | |
| s | 373058 | 6.7% |
| y | 373058 | 6.7% |
| u | 373058 | 6.7% |
| i | 373058 | 6.7% |
| a | 373058 | 6.7% |
| M | 186529 | 3.3% |
| d | 186529 | 3.3% |
| o | 186529 | 3.3% |
| Other values (10) | 1865290 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4290167 | |
| Uppercase Letter | 746116 | 13.3% |
| Space Separator | 559587 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 746116 | |
| s | 373058 | 8.7% |
| y | 373058 | 8.7% |
| u | 373058 | 8.7% |
| i | 373058 | 8.7% |
| a | 373058 | 8.7% |
| d | 186529 | 4.3% |
| o | 186529 | 4.3% |
| b | 186529 | 4.3% |
| t | 186529 | 4.3% |
| Other values (5) | 932645 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 186529 | |
| P | 186529 | |
| Y | 186529 | |
| U | 186529 |
Space Separator
| Value | Count | Frequency (%) |
| 559587 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5036283 | |
| Common | 559587 | 10.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 746116 | |
| s | 373058 | 7.4% |
| y | 373058 | 7.4% |
| u | 373058 | 7.4% |
| i | 373058 | 7.4% |
| a | 373058 | 7.4% |
| M | 186529 | 3.7% |
| d | 186529 | 3.7% |
| o | 186529 | 3.7% |
| b | 186529 | 3.7% |
| Other values (9) | 1678761 |
Common
| Value | Count | Frequency (%) |
| 559587 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5595870 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 746116 | 13.3% |
| 559587 | 10.0% | |
| s | 373058 | 6.7% |
| y | 373058 | 6.7% |
| u | 373058 | 6.7% |
| i | 373058 | 6.7% |
| a | 373058 | 6.7% |
| M | 186529 | 3.3% |
| d | 186529 | 3.3% |
| o | 186529 | 3.3% |
| Other values (10) | 1865290 |
references
Text
Unique 
| Distinct | 186529 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 59 |
| Mean length | 59.20648264 |
| Min length | 59 |
Unique
| Unique | 186529 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://collections.peabody.yale.edu/search/Record/YU.036650 |
|---|---|
| 2nd row | http://collections.peabody.yale.edu/search/Record/CBS.028950 |
| 3rd row | http://collections.peabody.yale.edu/search/Record/YU.070008 |
| 4th row | http://collections.peabody.yale.edu/search/Record/YU.204399 |
| 5th row | http://collections.peabody.yale.edu/search/Record/YU.175465 |
| Value | Count | Frequency (%) |
| http://collections.peabody.yale.edu/search/record/yu.036650 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/yu.065082 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/yu.065678 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/yu.234842 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/yu.012442 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/yu.070008 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/yu.204399 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/yu.175465 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/yu.060443 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/yu.038995 | 1 | < 0.1% |
| Other values (186519) | 186519 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1119174 | 10.1% |
| / | 932645 | 8.4% |
| . | 746144 | 6.8% |
| c | 746116 | 6.8% |
| o | 746116 | 6.8% |
| l | 559587 | 5.1% |
| a | 559587 | 5.1% |
| t | 559587 | 5.1% |
| d | 559587 | 5.1% |
| h | 373058 | 3.4% |
| Other values (25) | 4142125 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7461160 | |
| Other Punctuation | 1865318 | 16.9% |
| Decimal Number | 1119258 | 10.1% |
| Uppercase Letter | 597990 | 5.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1119174 | |
| c | 746116 | |
| o | 746116 | |
| l | 559587 | 7.5% |
| a | 559587 | 7.5% |
| t | 559587 | 7.5% |
| d | 559587 | 7.5% |
| h | 373058 | 5.0% |
| y | 373058 | 5.0% |
| p | 373058 | 5.0% |
| Other values (6) | 1492232 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 223252 | |
| 2 | 150900 | |
| 1 | 131446 | |
| 3 | 102608 | |
| 4 | 92196 | |
| 5 | 86628 | 7.7% |
| 8 | 85549 | 7.6% |
| 7 | 83994 | 7.5% |
| 6 | 83823 | 7.5% |
| 9 | 78862 | 7.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 186529 | |
| Y | 148126 | |
| U | 148126 | |
| C | 38403 | 6.4% |
| B | 38403 | 6.4% |
| S | 38403 | 6.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 932645 | |
| . | 746144 | |
| : | 186529 | 10.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8059150 | |
| Common | 2984576 | 27.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1119174 | |
| c | 746116 | 9.3% |
| o | 746116 | 9.3% |
| l | 559587 | 6.9% |
| a | 559587 | 6.9% |
| t | 559587 | 6.9% |
| d | 559587 | 6.9% |
| h | 373058 | 4.6% |
| y | 373058 | 4.6% |
| p | 373058 | 4.6% |
| Other values (12) | 2090222 |
Common
| Value | Count | Frequency (%) |
| / | 932645 | |
| . | 746144 | |
| 0 | 223252 | 7.5% |
| : | 186529 | 6.2% |
| 2 | 150900 | 5.1% |
| 1 | 131446 | 4.4% |
| 3 | 102608 | 3.4% |
| 4 | 92196 | 3.1% |
| 5 | 86628 | 2.9% |
| 8 | 85549 | 2.9% |
| Other values (3) | 246679 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11043726 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1119174 | 10.1% |
| / | 932645 | 8.4% |
| . | 746144 | 6.8% |
| c | 746116 | 6.8% |
| o | 746116 | 6.8% |
| l | 559587 | 5.1% |
| a | 559587 | 5.1% |
| t | 559587 | 5.1% |
| d | 559587 | 5.1% |
| h | 373058 | 3.4% |
| Other values (25) | 4142125 |
rightsHolder
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Yale Peabody Museum |
|---|---|
| 2nd row | Yale Peabody Museum |
| 3rd row | Yale Peabody Museum |
| 4th row | Yale Peabody Museum |
| 5th row | Yale Peabody Museum |
| Value | Count | Frequency (%) |
| yale | 186529 | |
| peabody | 186529 | |
| museum | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 559587 | |
| a | 373058 | |
| 373058 | ||
| u | 373058 | |
| Y | 186529 | 5.3% |
| l | 186529 | 5.3% |
| P | 186529 | 5.3% |
| b | 186529 | 5.3% |
| o | 186529 | 5.3% |
| d | 186529 | 5.3% |
| Other values (4) | 746116 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2611406 | |
| Uppercase Letter | 559587 | 15.8% |
| Space Separator | 373058 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 559587 | |
| a | 373058 | |
| u | 373058 | |
| l | 186529 | 7.1% |
| b | 186529 | 7.1% |
| o | 186529 | 7.1% |
| d | 186529 | 7.1% |
| y | 186529 | 7.1% |
| s | 186529 | 7.1% |
| m | 186529 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 186529 | |
| P | 186529 | |
| M | 186529 |
Space Separator
| Value | Count | Frequency (%) |
| 373058 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3170993 | |
| Common | 373058 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 559587 | |
| a | 373058 | |
| u | 373058 | |
| Y | 186529 | 5.9% |
| l | 186529 | 5.9% |
| P | 186529 | 5.9% |
| b | 186529 | 5.9% |
| o | 186529 | 5.9% |
| d | 186529 | 5.9% |
| y | 186529 | 5.9% |
| Other values (3) | 559587 |
Common
| Value | Count | Frequency (%) |
| 373058 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3544051 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 559587 | |
| a | 373058 | |
| 373058 | ||
| u | 373058 | |
| Y | 186529 | 5.3% |
| l | 186529 | 5.3% |
| P | 186529 | 5.3% |
| b | 186529 | 5.3% |
| o | 186529 | 5.3% |
| d | 186529 | 5.3% |
| Other values (4) | 746116 |
type
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PhysicalObject |
|---|---|
| 2nd row | PhysicalObject |
| 3rd row | PhysicalObject |
| 4th row | PhysicalObject |
| 5th row | PhysicalObject |
| Value | Count | Frequency (%) |
| physicalobject | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 373058 | |
| P | 186529 | 7.1% |
| h | 186529 | 7.1% |
| y | 186529 | 7.1% |
| s | 186529 | 7.1% |
| i | 186529 | 7.1% |
| a | 186529 | 7.1% |
| l | 186529 | 7.1% |
| O | 186529 | 7.1% |
| b | 186529 | 7.1% |
| Other values (3) | 559587 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2238348 | |
| Uppercase Letter | 373058 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 373058 | |
| h | 186529 | |
| y | 186529 | |
| s | 186529 | |
| i | 186529 | |
| a | 186529 | |
| l | 186529 | |
| b | 186529 | |
| j | 186529 | |
| e | 186529 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 186529 | |
| O | 186529 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2611406 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 373058 | |
| P | 186529 | 7.1% |
| h | 186529 | 7.1% |
| y | 186529 | 7.1% |
| s | 186529 | 7.1% |
| i | 186529 | 7.1% |
| a | 186529 | 7.1% |
| l | 186529 | 7.1% |
| O | 186529 | 7.1% |
| b | 186529 | 7.1% |
| Other values (3) | 559587 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2611406 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 373058 | |
| P | 186529 | 7.1% |
| h | 186529 | 7.1% |
| y | 186529 | 7.1% |
| s | 186529 | 7.1% |
| i | 186529 | 7.1% |
| a | 186529 | 7.1% |
| l | 186529 | 7.1% |
| O | 186529 | 7.1% |
| b | 186529 | 7.1% |
| Other values (3) | 559587 |
datasetID
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 177440 | |
| 0 | 9089 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 177440 | |
| 0 | 9089 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 186529 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 177440 | |
| 0 | 9089 | 4.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 186529 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 177440 | |
| 0 | 9089 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 186529 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 177440 | |
| 0 | 9089 | 4.9% |
institutionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | YPM |
|---|---|
| 2nd row | YPM |
| 3rd row | YPM |
| 4th row | YPM |
| 5th row | YPM |
| Value | Count | Frequency (%) |
| ypm | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 186529 | |
| P | 186529 | |
| M | 186529 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 559587 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 186529 | |
| P | 186529 | |
| M | 186529 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 559587 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 186529 | |
| P | 186529 | |
| M | 186529 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 559587 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 186529 | |
| P | 186529 | |
| M | 186529 |
collectionCode
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.205882195 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | YU |
|---|---|
| 2nd row | CBS |
| 3rd row | YU |
| 4th row | YU |
| 5th row | YU |
| Value | Count | Frequency (%) |
| yu | 148126 | |
| cbs | 38403 | 20.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 148126 | |
| U | 148126 | |
| C | 38403 | 9.3% |
| B | 38403 | 9.3% |
| S | 38403 | 9.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 411461 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 148126 | |
| U | 148126 | |
| C | 38403 | 9.3% |
| B | 38403 | 9.3% |
| S | 38403 | 9.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 411461 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 148126 | |
| U | 148126 | |
| C | 38403 | 9.3% |
| B | 38403 | 9.3% |
| S | 38403 | 9.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 411461 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 148126 | |
| U | 148126 | |
| C | 38403 | 9.3% |
| B | 38403 | 9.3% |
| S | 38403 | 9.3% |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | YPM |
|---|---|
| 2nd row | YPM |
| 3rd row | YPM |
| 4th row | YPM |
| 5th row | YPM |
| Value | Count | Frequency (%) |
| ypm | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 186529 | |
| P | 186529 | |
| M | 186529 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 559587 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 186529 | |
| P | 186529 | |
| M | 186529 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 559587 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 186529 | |
| P | 186529 | |
| M | 186529 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 559587 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 186529 | |
| P | 186529 | |
| M | 186529 |
basisOfRecord
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 18 |
| Min length | 18 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESERVED_SPECIMEN |
|---|---|
| 2nd row | PRESERVED_SPECIMEN |
| 3rd row | PRESERVED_SPECIMEN |
| 4th row | PRESERVED_SPECIMEN |
| 5th row | PRESERVED_SPECIMEN |
| Value | Count | Frequency (%) |
| preserved_specimen | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 932645 | |
| P | 373058 | 11.1% |
| R | 373058 | 11.1% |
| S | 373058 | 11.1% |
| V | 186529 | 5.6% |
| D | 186529 | 5.6% |
| _ | 186529 | 5.6% |
| C | 186529 | 5.6% |
| I | 186529 | 5.6% |
| M | 186529 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3170993 | |
| Connector Punctuation | 186529 | 5.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 932645 | |
| P | 373058 | 11.8% |
| R | 373058 | 11.8% |
| S | 373058 | 11.8% |
| V | 186529 | 5.9% |
| D | 186529 | 5.9% |
| C | 186529 | 5.9% |
| I | 186529 | 5.9% |
| M | 186529 | 5.9% |
| N | 186529 | 5.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 186529 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3170993 | |
| Common | 186529 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 932645 | |
| P | 373058 | 11.8% |
| R | 373058 | 11.8% |
| S | 373058 | 11.8% |
| V | 186529 | 5.9% |
| D | 186529 | 5.9% |
| C | 186529 | 5.9% |
| I | 186529 | 5.9% |
| M | 186529 | 5.9% |
| N | 186529 | 5.9% |
Common
| Value | Count | Frequency (%) |
| _ | 186529 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3357522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 932645 | |
| P | 373058 | 11.1% |
| R | 373058 | 11.1% |
| S | 373058 | 11.1% |
| V | 186529 | 5.6% |
| D | 186529 | 5.6% |
| _ | 186529 | 5.6% |
| C | 186529 | 5.6% |
| I | 186529 | 5.6% |
| M | 186529 | 5.6% |
Unique 
| Distinct | 186529 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 468 |
|---|---|
| Median length | 364 |
| Mean length | 129.8176048 |
| Min length | 20 |
Unique
| Unique | 186529 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | { "irn": "1284160", "media": "1049200:23e7a3e4-d0b0-4e83-9ff2-192065f61a5a", "mm_repository_id": "1049200" } |
|---|---|
| 2nd row | { "irn": "1377942", "media": "109412:2de8b571-4db4-4d56-aca6-faa3477edb7c", "mm_repository_id": "109412", "solr_long_lat": "-72.2664,41.4854" } |
| 3rd row | { "irn": "908073", "solr_long_lat": "-72.9316,41.4070" } |
| 4th row | { "irn": "1892063", "media": "268631:3adf8b86-2732-45cd-aef6-c1ead71bd726", "mm_repository_id": "268631", "solr_long_lat": "-119,51" } |
| 5th row | { "irn": "2463858", "media": "1186778:f2d4000d-7289-44d9-bba3-f87582cd4f33 1186779:5b8ba8d4-ba11-4789-b865-bf0d163e1e42", "mm_repository_id": "1186778" } |
| Value | Count | Frequency (%) |
| 373805 | ||
| irn | 186529 | 11.0% |
| mm_repository_id | 177182 | 10.5% |
| media | 177182 | 10.5% |
| solr_long_lat | 104429 | 6.2% |
| 72.9316,41.4070 | 1988 | 0.1% |
| 72.920823,41.305111 | 1951 | 0.1% |
| 72.9247,41.3114 | 1870 | 0.1% |
| 72.88,41.6050 | 1661 | 0.1% |
| 73.036,41.5583 | 1211 | 0.1% |
| Other values (569062) | 662556 |
Most occurring characters
| Value | Count | Frequency (%) |
| " | 2587264 | 10.7% |
| 1503835 | 6.2% | |
| 1 | 1306746 | 5.4% |
| 4 | 1080313 | 4.5% |
| 2 | 1005309 | 4.2% |
| - | 909018 | 3.8% |
| 9 | 877072 | 3.6% |
| 8 | 856889 | 3.5% |
| 3 | 854143 | 3.5% |
| 7 | 850393 | 3.5% |
| Other values (34) | 12383766 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9188047 | |
| Lowercase Letter | 7460825 | |
| Other Punctuation | 4207683 | |
| Space Separator | 1503835 | 6.2% |
| Dash Punctuation | 909018 | 3.8% |
| Connector Punctuation | 566210 | 2.3% |
| Open Punctuation | 186529 | 0.8% |
| Close Punctuation | 186529 | 0.8% |
| Uppercase Letter | 5686 | < 0.1% |
| Math Symbol | 386 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 736138 | |
| d | 734755 | |
| i | 718822 | |
| a | 709722 | |
| r | 649804 | 8.7% |
| o | 564716 | 7.6% |
| m | 531546 | 7.1% |
| b | 425696 | 5.7% |
| c | 377479 | 5.1% |
| f | 376923 | 5.1% |
| Other values (8) | 1635224 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1306746 | |
| 4 | 1080313 | |
| 2 | 1005309 | |
| 9 | 877072 | |
| 8 | 856889 | |
| 3 | 854143 | |
| 7 | 850393 | |
| 6 | 823084 | |
| 0 | 786476 | |
| 5 | 747622 |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 2266 | |
| P | 1140 | |
| M | 1140 | |
| U | 1126 | |
| A | 7 | 0.1% |
| R | 7 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| " | 2587264 | |
| : | 847672 | 20.1% |
| , | 564716 | 13.4% |
| . | 208031 | 4.9% |
Space Separator
| Value | Count | Frequency (%) |
| 1503835 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 909018 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 566210 |
Open Punctuation
| Value | Count | Frequency (%) |
| { | 186529 |
Close Punctuation
| Value | Count | Frequency (%) |
| } | 186529 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 386 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 16748237 | |
| Latin | 7466511 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 736138 | |
| d | 734755 | |
| i | 718822 | |
| a | 709722 | |
| r | 649804 | 8.7% |
| o | 564716 | 7.6% |
| m | 531546 | 7.1% |
| b | 425696 | 5.7% |
| c | 377479 | 5.1% |
| f | 376923 | 5.0% |
| Other values (14) | 1640910 |
Common
| Value | Count | Frequency (%) |
| " | 2587264 | |
| 1503835 | 9.0% | |
| 1 | 1306746 | 7.8% |
| 4 | 1080313 | 6.5% |
| 2 | 1005309 | 6.0% |
| - | 909018 | 5.4% |
| 9 | 877072 | 5.2% |
| 8 | 856889 | 5.1% |
| 3 | 854143 | 5.1% |
| 7 | 850393 | 5.1% |
| Other values (10) | 4917255 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24214748 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| " | 2587264 | 10.7% |
| 1503835 | 6.2% | |
| 1 | 1306746 | 5.4% |
| 4 | 1080313 | 4.5% |
| 2 | 1005309 | 4.2% |
| - | 909018 | 3.8% |
| 9 | 877072 | 3.6% |
| 8 | 856889 | 3.5% |
| 3 | 854143 | 3.5% |
| 7 | 850393 | 3.5% |
| Other values (34) | 12383766 |
occurrenceID
Text
Unique 
| Distinct | 186529 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 186529 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | urn:uuid:a15cbeaa-3fcd-4ec5-bfb1-27f0f8bc8910 |
|---|---|
| 2nd row | urn:uuid:a15e0d7e-5095-4a84-b02b-fe689f416389 |
| 3rd row | urn:uuid:a165d6f6-a6f1-4464-9d19-d307fba92359 |
| 4th row | urn:uuid:a1674501-cb24-4a3a-9ef8-4d0751ad4e63 |
| 5th row | urn:uuid:a169b221-8413-44a8-bccc-fa7045bf79df |
| Value | Count | Frequency (%) |
| urn:uuid:a15cbeaa-3fcd-4ec5-bfb1-27f0f8bc8910 | 1 | < 0.1% |
| urn:uuid:a1d1e7f6-c3fd-4cdf-92eb-181c3735610c | 1 | < 0.1% |
| urn:uuid:a19015bb-6550-4f6a-afda-a2f1f7015626 | 1 | < 0.1% |
| urn:uuid:a276dcf5-b6fd-4a0e-a9c9-e3d67d274f2c | 1 | < 0.1% |
| urn:uuid:a18d197d-f3bd-4416-bdef-f4a9f2135f3e | 1 | < 0.1% |
| urn:uuid:a165d6f6-a6f1-4464-9d19-d307fba92359 | 1 | < 0.1% |
| urn:uuid:a1674501-cb24-4a3a-9ef8-4d0751ad4e63 | 1 | < 0.1% |
| urn:uuid:a169b221-8413-44a8-bccc-fa7045bf79df | 1 | < 0.1% |
| urn:uuid:a16fdf5e-d4db-44ab-8f13-95359c948f0c | 1 | < 0.1% |
| urn:uuid:a17057c5-3a20-44d8-b8bb-a4febbcf747a | 1 | < 0.1% |
| Other values (186519) | 186519 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 746116 | 8.9% |
| u | 559587 | 6.7% |
| d | 536327 | 6.4% |
| 4 | 535641 | 6.4% |
| 8 | 397325 | 4.7% |
| a | 396721 | 4.7% |
| b | 396278 | 4.7% |
| 9 | 396248 | 4.7% |
| : | 373058 | 4.4% |
| c | 350587 | 4.2% |
| Other values (12) | 3705917 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3777062 | |
| Lowercase Letter | 3497569 | |
| Dash Punctuation | 746116 | 8.9% |
| Other Punctuation | 373058 | 4.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 559587 | |
| d | 536327 | |
| a | 396721 | |
| b | 396278 | |
| c | 350587 | |
| f | 349437 | |
| e | 349045 | |
| r | 186529 | 5.3% |
| i | 186529 | 5.3% |
| n | 186529 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 535641 | |
| 8 | 397325 | |
| 9 | 396248 | |
| 1 | 350371 | |
| 6 | 349908 | |
| 7 | 349825 | |
| 5 | 349699 | |
| 3 | 349432 | |
| 0 | 349369 | |
| 2 | 349244 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 746116 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 373058 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4896236 | |
| Latin | 3497569 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 746116 | |
| 4 | 535641 | |
| 8 | 397325 | |
| 9 | 396248 | |
| : | 373058 | |
| 1 | 350371 | |
| 6 | 349908 | |
| 7 | 349825 | |
| 5 | 349699 | |
| 3 | 349432 | |
| Other values (2) | 698613 |
Latin
| Value | Count | Frequency (%) |
| u | 559587 | |
| d | 536327 | |
| a | 396721 | |
| b | 396278 | |
| c | 350587 | |
| f | 349437 | |
| e | 349045 | |
| r | 186529 | 5.3% |
| i | 186529 | 5.3% |
| n | 186529 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8393805 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 746116 | 8.9% |
| u | 559587 | 6.7% |
| d | 536327 | 6.4% |
| 4 | 535641 | 6.4% |
| 8 | 397325 | 4.7% |
| a | 396721 | 4.7% |
| b | 396278 | 4.7% |
| 9 | 396248 | 4.7% |
| : | 373058 | 4.4% |
| c | 350587 | 4.2% |
| Other values (12) | 3705917 |
catalogNumber
Text
Unique 
| Distinct | 186529 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 9.206482638 |
| Min length | 9 |
Unique
| Unique | 186529 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | YU.036650 |
|---|---|
| 2nd row | CBS.028950 |
| 3rd row | YU.070008 |
| 4th row | YU.204399 |
| 5th row | YU.175465 |
| Value | Count | Frequency (%) |
| yu.036650 | 1 | < 0.1% |
| yu.065082 | 1 | < 0.1% |
| yu.065678 | 1 | < 0.1% |
| yu.234842 | 1 | < 0.1% |
| yu.012442 | 1 | < 0.1% |
| yu.070008 | 1 | < 0.1% |
| yu.204399 | 1 | < 0.1% |
| yu.175465 | 1 | < 0.1% |
| yu.060443 | 1 | < 0.1% |
| yu.038995 | 1 | < 0.1% |
| Other values (186519) | 186519 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 223252 | |
| . | 186557 | |
| 2 | 150900 | |
| Y | 148126 | |
| U | 148126 | |
| 1 | 131446 | 7.7% |
| 3 | 102608 | 6.0% |
| 4 | 92196 | 5.4% |
| 5 | 86628 | 5.0% |
| 8 | 85549 | 5.0% |
| Other values (6) | 361888 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1119258 | |
| Uppercase Letter | 411461 | 24.0% |
| Other Punctuation | 186557 | 10.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 223252 | |
| 2 | 150900 | |
| 1 | 131446 | |
| 3 | 102608 | |
| 4 | 92196 | |
| 5 | 86628 | 7.7% |
| 8 | 85549 | 7.6% |
| 7 | 83994 | 7.5% |
| 6 | 83823 | 7.5% |
| 9 | 78862 | 7.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 148126 | |
| U | 148126 | |
| C | 38403 | 9.3% |
| B | 38403 | 9.3% |
| S | 38403 | 9.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 186557 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1305815 | |
| Latin | 411461 | 24.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 223252 | |
| . | 186557 | |
| 2 | 150900 | |
| 1 | 131446 | |
| 3 | 102608 | |
| 4 | 92196 | |
| 5 | 86628 | 6.6% |
| 8 | 85549 | 6.6% |
| 7 | 83994 | 6.4% |
| 6 | 83823 | 6.4% |
Latin
| Value | Count | Frequency (%) |
| Y | 148126 | |
| U | 148126 | |
| C | 38403 | 9.3% |
| B | 38403 | 9.3% |
| S | 38403 | 9.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1717276 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 223252 | |
| . | 186557 | |
| 2 | 150900 | |
| Y | 148126 | |
| U | 148126 | |
| 1 | 131446 | 7.7% |
| 3 | 102608 | 6.0% |
| 4 | 92196 | 5.4% |
| 5 | 86628 | 5.0% |
| 8 | 85549 | 5.0% |
| Other values (6) | 361888 |
recordNumber
Text
Missing 
| Distinct | 13601 |
|---|---|
| Distinct (%) | 28.6% |
| Missing | 139017 |
| Missing (%) | 74.5% |
| Memory size | 1.4 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 20 |
| Mean length | 3.446729247 |
| Min length | 1 |
Unique
| Unique | 7343 ? |
|---|---|
| Unique (%) | 15.5% |
Sample
| 1st row | 4856 |
|---|---|
| 2nd row | 621 |
| 3rd row | 12 |
| 4th row | 545 |
| 5th row | 4616 |
| Value | Count | Frequency (%) |
| 2 | 265 | 0.5% |
| 1 | 234 | 0.5% |
| 3 | 209 | 0.4% |
| 4 | 207 | 0.4% |
| 8 | 177 | 0.4% |
| 6 | 176 | 0.4% |
| 5 | 171 | 0.4% |
| 7 | 163 | 0.3% |
| 9 | 156 | 0.3% |
| 10 | 150 | 0.3% |
| Other values (12986) | 46388 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 25684 | |
| 2 | 19997 | |
| 3 | 17169 | |
| 4 | 15274 | |
| 5 | 14907 | |
| 6 | 13292 | |
| 7 | 12976 | |
| 8 | 12576 | |
| 0 | 12513 | |
| 9 | 12410 | |
| Other values (67) | 6963 | 4.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 156798 | |
| Lowercase Letter | 2347 | 1.4% |
| Uppercase Letter | 1734 | 1.1% |
| Other Punctuation | 1266 | 0.8% |
| Space Separator | 784 | 0.5% |
| Dash Punctuation | 744 | 0.5% |
| Math Symbol | 44 | < 0.1% |
| Open Punctuation | 22 | < 0.1% |
| Close Punctuation | 22 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 795 | |
| p | 631 | |
| b | 355 | |
| c | 92 | 3.9% |
| d | 77 | 3.3% |
| u | 63 | 2.7% |
| n | 62 | 2.6% |
| e | 49 | 2.1% |
| o | 32 | 1.4% |
| r | 26 | 1.1% |
| Other values (15) | 165 | 7.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 188 | |
| D | 177 | |
| X | 156 | 9.0% |
| B | 139 | 8.0% |
| I | 130 | 7.5% |
| A | 124 | 7.2% |
| P | 115 | 6.6% |
| C | 114 | 6.6% |
| E | 104 | 6.0% |
| W | 75 | 4.3% |
| Other values (15) | 412 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 25684 | |
| 2 | 19997 | |
| 3 | 17169 | |
| 4 | 15274 | |
| 5 | 14907 | |
| 6 | 13292 | |
| 7 | 12976 | |
| 8 | 12576 | |
| 0 | 12513 | |
| 9 | 12410 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 834 | |
| / | 221 | 17.5% |
| , | 138 | 10.9% |
| # | 35 | 2.8% |
| & | 16 | 1.3% |
| : | 10 | 0.8% |
| ? | 6 | 0.5% |
| ' | 5 | 0.4% |
| ; | 1 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 22 | |
| + | 22 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 20 | |
| [ | 2 | 9.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 20 | |
| ] | 2 | 9.1% |
Space Separator
| Value | Count | Frequency (%) |
| 784 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 744 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 159680 | |
| Latin | 4081 | 2.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 795 | |
| p | 631 | |
| b | 355 | 8.7% |
| S | 188 | 4.6% |
| D | 177 | 4.3% |
| X | 156 | 3.8% |
| B | 139 | 3.4% |
| I | 130 | 3.2% |
| A | 124 | 3.0% |
| P | 115 | 2.8% |
| Other values (40) | 1271 |
Common
| Value | Count | Frequency (%) |
| 1 | 25684 | |
| 2 | 19997 | |
| 3 | 17169 | |
| 4 | 15274 | |
| 5 | 14907 | |
| 6 | 13292 | |
| 7 | 12976 | |
| 8 | 12576 | |
| 0 | 12513 | |
| 9 | 12410 | |
| Other values (17) | 2882 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 163761 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 25684 | |
| 2 | 19997 | |
| 3 | 17169 | |
| 4 | 15274 | |
| 5 | 14907 | |
| 6 | 13292 | |
| 7 | 12976 | |
| 8 | 12576 | |
| 0 | 12513 | |
| 9 | 12410 | |
| Other values (67) | 6963 | 4.3% |
recordedBy
Text
Missing 
| Distinct | 3451 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 75764 |
| Missing (%) | 40.6% |
| Memory size | 1.4 MiB |
Length
| Max length | 98 |
|---|---|
| Median length | 94 |
| Mean length | 16.95773033 |
| Min length | 2 |
Unique
| Unique | 1506 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | Charles H. Bissell |
|---|---|
| 2nd row | Horatio N. Fenn |
| 3rd row | Alfred H. Brinkman |
| 4th row | Charles C. Godfrey |
| 5th row | Charles H. Bissell |
| Value | Count | Frequency (%) |
| h | 17884 | 5.3% |
| charles | 16797 | 5.0% |
| w | 13815 | 4.1% |
| e | 13699 | 4.1% |
| a | 9233 | 2.8% |
| george | 9101 | 2.7% |
| bissell | 8948 | 2.7% |
| c | 7711 | 2.3% |
| nichols | 6625 | 2.0% |
| b | 6460 | 1.9% |
| Other values (2822) | 225265 |
Most occurring characters
| Value | Count | Frequency (%) |
| 224773 | 12.0% | |
| e | 165937 | 8.8% |
| r | 129288 | 6.9% |
| a | 119892 | 6.4% |
| l | 112246 | 6.0% |
| . | 107354 | 5.7% |
| n | 97287 | 5.2% |
| s | 80163 | 4.3% |
| i | 77500 | 4.1% |
| o | 75225 | 4.0% |
| Other values (70) | 688658 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1200369 | |
| Uppercase Letter | 335975 | 17.9% |
| Space Separator | 224773 | 12.0% |
| Other Punctuation | 116139 | 6.2% |
| Decimal Number | 799 | < 0.1% |
| Close Punctuation | 96 | < 0.1% |
| Open Punctuation | 96 | < 0.1% |
| Dash Punctuation | 76 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 165937 | |
| r | 129288 | |
| a | 119892 | |
| l | 112246 | |
| n | 97287 | |
| s | 80163 | 6.7% |
| i | 77500 | 6.5% |
| o | 75225 | 6.3% |
| h | 59764 | 5.0% |
| t | 52406 | 4.4% |
| Other values (21) | 230661 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 36558 | |
| E | 36255 | |
| H | 33567 | |
| A | 31547 | |
| W | 29410 | 8.8% |
| B | 28312 | 8.4% |
| S | 17816 | 5.3% |
| G | 17569 | 5.2% |
| J | 15225 | 4.5% |
| L | 13747 | 4.1% |
| Other values (17) | 75969 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 400 | |
| 9 | 184 | |
| 4 | 107 | 13.4% |
| 8 | 59 | 7.4% |
| 3 | 20 | 2.5% |
| 2 | 18 | 2.3% |
| 5 | 6 | 0.8% |
| 7 | 3 | 0.4% |
| 6 | 2 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 107354 | |
| , | 4807 | 4.1% |
| ; | 3914 | 3.4% |
| ' | 53 | < 0.1% |
| ? | 5 | < 0.1% |
| & | 4 | < 0.1% |
| / | 2 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 91 | |
| ] | 5 | 5.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 91 | |
| [ | 5 | 5.2% |
Space Separator
| Value | Count | Frequency (%) |
| 224773 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 76 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1536344 | |
| Common | 341979 | 18.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 165937 | 10.8% |
| r | 129288 | 8.4% |
| a | 119892 | 7.8% |
| l | 112246 | 7.3% |
| n | 97287 | 6.3% |
| s | 80163 | 5.2% |
| i | 77500 | 5.0% |
| o | 75225 | 4.9% |
| h | 59764 | 3.9% |
| t | 52406 | 3.4% |
| Other values (48) | 566636 |
Common
| Value | Count | Frequency (%) |
| 224773 | ||
| . | 107354 | |
| , | 4807 | 1.4% |
| ; | 3914 | 1.1% |
| 1 | 400 | 0.1% |
| 9 | 184 | 0.1% |
| 4 | 107 | < 0.1% |
| ) | 91 | < 0.1% |
| ( | 91 | < 0.1% |
| - | 76 | < 0.1% |
| Other values (12) | 182 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1878190 | |
| None | 133 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 224773 | 12.0% | |
| e | 165937 | 8.8% |
| r | 129288 | 6.9% |
| a | 119892 | 6.4% |
| l | 112246 | 6.0% |
| . | 107354 | 5.7% |
| n | 97287 | 5.2% |
| s | 80163 | 4.3% |
| i | 77500 | 4.1% |
| o | 75225 | 4.0% |
| Other values (64) | 688525 |
None
| Value | Count | Frequency (%) |
| á | 122 | |
| ö | 4 | 3.0% |
| ô | 4 | 3.0% |
| è | 1 | 0.8% |
| É | 1 | 0.8% |
| ä | 1 | 0.8% |
individualCount
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 186529 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 186529 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 186529 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 186529 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 186529 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 186529 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 186529 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 16.0% |
| Missing | 186504 |
| Missing (%) | > 99.9% |
| Memory size | 1.4 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 9 |
| Mean length | 10.28 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 4.0% |
Sample
| 1st row | Flowering |
|---|---|
| 2nd row | Flowering |
| 3rd row | Flowering |
| 4th row | Flowering & Fruiting. |
| 5th row | Fruiting |
| Value | Count | Frequency (%) |
| flowering | 20 | |
| fruiting | 6 | 18.8% |
| 2 | 6.2% | |
| male | 1 | 3.1% |
| and | 1 | 3.1% |
| female | 1 | 3.1% |
| cones | 1 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 32 | |
| n | 28 | |
| F | 26 | |
| r | 26 | |
| g | 26 | |
| e | 24 | |
| l | 22 | |
| o | 21 | |
| w | 20 | |
| 7 | 2.7% | |
| Other values (10) | 25 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 220 | |
| Uppercase Letter | 26 | 10.1% |
| Space Separator | 7 | 2.7% |
| Other Punctuation | 4 | 1.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 32 | |
| n | 28 | |
| r | 26 | |
| g | 26 | |
| e | 24 | |
| l | 22 | |
| o | 21 | |
| w | 20 | |
| t | 6 | 2.7% |
| u | 6 | 2.7% |
| Other values (6) | 9 | 4.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 2 | |
| . | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 26 |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 246 | |
| Common | 11 | 4.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 32 | |
| n | 28 | |
| F | 26 | |
| r | 26 | |
| g | 26 | |
| e | 24 | |
| l | 22 | |
| o | 21 | |
| w | 20 | |
| t | 6 | 2.4% |
| Other values (7) | 15 |
Common
| Value | Count | Frequency (%) |
| 7 | ||
| & | 2 | 18.2% |
| . | 2 | 18.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 257 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 32 | |
| n | 28 | |
| F | 26 | |
| r | 26 | |
| g | 26 | |
| e | 24 | |
| l | 22 | |
| o | 21 | |
| w | 20 | |
| 7 | 2.7% | |
| Other values (10) | 25 |
occurrenceStatus
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESENT |
|---|---|
| 2nd row | PRESENT |
| 3rd row | PRESENT |
| 4th row | PRESENT |
| 5th row | PRESENT |
| Value | Count | Frequency (%) |
| present | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 373058 | |
| P | 186529 | |
| R | 186529 | |
| S | 186529 | |
| N | 186529 | |
| T | 186529 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1305703 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 373058 | |
| P | 186529 | |
| R | 186529 | |
| S | 186529 | |
| N | 186529 | |
| T | 186529 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1305703 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 373058 | |
| P | 186529 | |
| R | 186529 | |
| S | 186529 | |
| N | 186529 | |
| T | 186529 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1305703 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 373058 | |
| P | 186529 | |
| R | 186529 | |
| S | 186529 | |
| N | 186529 | |
| T | 186529 |
preparations
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 186476 |
| Missing (%) | > 99.9% |
| Memory size | 1.4 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 15 |
| Min length | 15 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | tissue (frozen) |
|---|---|
| 2nd row | tissue (frozen) |
| 3rd row | tissue (frozen) |
| 4th row | tissue (frozen) |
| 5th row | tissue (frozen) |
| Value | Count | Frequency (%) |
| tissue | 53 | |
| frozen | 53 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 106 | |
| e | 106 | |
| t | 53 | 6.7% |
| i | 53 | 6.7% |
| u | 53 | 6.7% |
| 53 | 6.7% | |
| ( | 53 | 6.7% |
| f | 53 | 6.7% |
| r | 53 | 6.7% |
| o | 53 | 6.7% |
| Other values (3) | 159 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 636 | |
| Space Separator | 53 | 6.7% |
| Open Punctuation | 53 | 6.7% |
| Close Punctuation | 53 | 6.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 106 | |
| e | 106 | |
| t | 53 | |
| i | 53 | |
| u | 53 | |
| f | 53 | |
| r | 53 | |
| o | 53 | |
| z | 53 | |
| n | 53 |
Space Separator
| Value | Count | Frequency (%) |
| 53 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 53 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 53 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 636 | |
| Common | 159 | 20.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 106 | |
| e | 106 | |
| t | 53 | |
| i | 53 | |
| u | 53 | |
| f | 53 | |
| r | 53 | |
| o | 53 | |
| z | 53 | |
| n | 53 |
Common
| Value | Count | Frequency (%) |
| 53 | ||
| ( | 53 | |
| ) | 53 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 795 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 106 | |
| e | 106 | |
| t | 53 | 6.7% |
| i | 53 | 6.7% |
| u | 53 | 6.7% |
| 53 | 6.7% | |
| ( | 53 | 6.7% |
| f | 53 | 6.7% |
| r | 53 | 6.7% |
| o | 53 | 6.7% |
| Other values (3) | 159 |
disposition
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | in collection |
|---|---|
| 2nd row | in collection |
| 3rd row | in collection |
| 4th row | in collection |
| 5th row | in collection |
| Value | Count | Frequency (%) |
| in | 186529 | |
| collection | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 373058 | |
| n | 373058 | |
| c | 373058 | |
| o | 373058 | |
| l | 373058 | |
| 186529 | ||
| e | 186529 | |
| t | 186529 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2238348 | |
| Space Separator | 186529 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 373058 | |
| n | 373058 | |
| c | 373058 | |
| o | 373058 | |
| l | 373058 | |
| e | 186529 | |
| t | 186529 |
Space Separator
| Value | Count | Frequency (%) |
| 186529 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2238348 | |
| Common | 186529 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 373058 | |
| n | 373058 | |
| c | 373058 | |
| o | 373058 | |
| l | 373058 | |
| e | 186529 | |
| t | 186529 |
Common
| Value | Count | Frequency (%) |
| 186529 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2424877 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 373058 | |
| n | 373058 | |
| c | 373058 | |
| o | 373058 | |
| l | 373058 | |
| 186529 | ||
| e | 186529 | |
| t | 186529 |
Missing 
| Distinct | 3765 |
|---|---|
| Distinct (%) | 37.4% |
| Missing | 176462 |
| Missing (%) | 94.6% |
| Memory size | 1.4 MiB |
Length
| Max length | 481 |
|---|---|
| Median length | 338 |
| Mean length | 43.67040826 |
| Min length | 1 |
Unique
| Unique | 3122 ? |
|---|---|
| Unique (%) | 31.0% |
Sample
| 1st row | Det. by: Martin C. Van Boskirk 1997| |
|---|---|
| 2nd row | Det. by: Alexander W. Evans |
| 3rd row | ISOTYPE. Note: Proc. Amer. Acad. Arts. 22: 420. 1887. |
| 4th row | ISOSYNTYPE. Note: Mem. Amer. Acad. Arts. n.s. 520. 1862. |
| 5th row | ISOTYPE. Note: Pl. Wright. (Grisebach) 1: 173. 1860. |
| Value | Count | Frequency (%) |
| by | 6513 | 8.7% |
| det | 6278 | 8.4% |
| note | 4081 | 5.5% |
| isotype | 2637 | 3.5% |
| of | 1965 | 2.6% |
| w | 1081 | 1.4% |
| the | 1033 | 1.4% |
| syntype | 884 | 1.2% |
| arts | 839 | 1.1% |
| amer | 784 | 1.0% |
| Other values (2983) | 48652 |
Most occurring characters
| Value | Count | Frequency (%) |
| 64680 | 14.7% | |
| . | 32693 | 7.4% |
| e | 30865 | 7.0% |
| t | 21244 | 4.8% |
| o | 17185 | 3.9% |
| a | 16258 | 3.7% |
| r | 16177 | 3.7% |
| : | 13805 | 3.1% |
| n | 13259 | 3.0% |
| i | 11346 | 2.6% |
| Other values (83) | 202118 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 202214 | |
| Uppercase Letter | 77380 | 17.6% |
| Space Separator | 64680 | 14.7% |
| Other Punctuation | 47613 | 10.8% |
| Decimal Number | 41766 | 9.5% |
| Math Symbol | 4292 | 1.0% |
| Dash Punctuation | 591 | 0.1% |
| Close Punctuation | 546 | 0.1% |
| Open Punctuation | 546 | 0.1% |
| Other Symbol | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 30865 | |
| t | 21244 | |
| o | 17185 | 8.5% |
| a | 16258 | 8.0% |
| r | 16177 | 8.0% |
| n | 13259 | 6.6% |
| i | 11346 | 5.6% |
| l | 10399 | 5.1% |
| y | 8787 | 4.3% |
| s | 8678 | 4.3% |
| Other values (24) | 48016 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 6983 | 9.0% |
| P | 6970 | 9.0% |
| E | 6536 | 8.4% |
| S | 6410 | 8.3% |
| A | 6193 | 8.0% |
| N | 6019 | 7.8% |
| Y | 5458 | 7.1% |
| T | 5403 | 7.0% |
| C | 3719 | 4.8% |
| O | 3662 | 4.7% |
| Other values (16) | 20027 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 32693 | |
| : | 13805 | |
| ; | 451 | 0.9% |
| , | 420 | 0.9% |
| ' | 92 | 0.2% |
| ? | 81 | 0.2% |
| & | 36 | 0.1% |
| " | 27 | 0.1% |
| # | 6 | < 0.1% |
| / | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 10167 | |
| 8 | 6112 | |
| 9 | 4977 | |
| 6 | 3913 | 9.4% |
| 2 | 3487 | 8.3% |
| 7 | 3041 | 7.3% |
| 5 | 3040 | 7.3% |
| 4 | 2459 | 5.9% |
| 3 | 2393 | 5.7% |
| 0 | 2177 | 5.2% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 4244 | |
| = | 44 | 1.0% |
| + | 4 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 380 | |
| ] | 164 | |
| } | 2 | 0.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 379 | |
| [ | 165 | |
| { | 2 | 0.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 563 | |
| – | 28 | 4.7% |
Space Separator
| Value | Count | Frequency (%) |
| 64680 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 279594 | |
| Common | 160036 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 30865 | 11.0% |
| t | 21244 | 7.6% |
| o | 17185 | 6.1% |
| a | 16258 | 5.8% |
| r | 16177 | 5.8% |
| n | 13259 | 4.7% |
| i | 11346 | 4.1% |
| l | 10399 | 3.7% |
| y | 8787 | 3.1% |
| s | 8678 | 3.1% |
| Other values (50) | 125396 |
Common
| Value | Count | Frequency (%) |
| 64680 | ||
| . | 32693 | |
| : | 13805 | 8.6% |
| 1 | 10167 | 6.4% |
| 8 | 6112 | 3.8% |
| 9 | 4977 | 3.1% |
| | | 4244 | 2.7% |
| 6 | 3913 | 2.4% |
| 2 | 3487 | 2.2% |
| 7 | 3041 | 1.9% |
| Other values (23) | 12917 | 8.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 439412 | |
| None | 190 | < 0.1% |
| Punctuation | 28 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 64680 | 14.7% | |
| . | 32693 | 7.4% |
| e | 30865 | 7.0% |
| t | 21244 | 4.8% |
| o | 17185 | 3.9% |
| a | 16258 | 3.7% |
| r | 16177 | 3.7% |
| : | 13805 | 3.1% |
| n | 13259 | 3.0% |
| i | 11346 | 2.6% |
| Other values (73) | 201900 |
None
| Value | Count | Frequency (%) |
| á | 125 | |
| ü | 26 | 13.7% |
| é | 23 | 12.1% |
| ö | 8 | 4.2% |
| ä | 2 | 1.1% |
| è | 2 | 1.1% |
| ° | 2 | 1.1% |
| ë | 1 | 0.5% |
| ñ | 1 | 0.5% |
Punctuation
| Value | Count | Frequency (%) |
| – | 28 |
associatedTaxa
Text
Missing 
| Distinct | 745 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 185782 |
| Missing (%) | 99.6% |
| Memory size | 1.4 MiB |
Length
| Max length | 109 |
|---|---|
| Median length | 21 |
| Mean length | 29.93574297 |
| Min length | 9 |
Unique
| Unique | 743 ? |
|---|---|
| Unique (%) | 99.5% |
Sample
| 1st row | same sheet: YU.064497|same sheet: YU.064498|same sheet: YU.064500 |
|---|---|
| 2nd row | same sheet: YU.064978 |
| 3rd row | YU.000992 |
| 4th row | same sheet: YU.064670 |
| 5th row | same sheet: YU.001167 |
| Value | Count | Frequency (%) |
| sheet | 965 | |
| same | 649 | |
| replicate | 9 | 0.3% |
| yu.065496|same | 5 | 0.2% |
| yu.014017|same | 5 | 0.2% |
| yu.014019|same | 5 | 0.2% |
| yu.014020|same | 5 | 0.2% |
| yu.014022 | 5 | 0.2% |
| yu.065492 | 5 | 0.2% |
| yu.065494|same | 5 | 0.2% |
| Other values (832) | 1037 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2925 | |
| 1948 | 8.7% | |
| s | 1930 | 8.6% |
| 0 | 1853 | 8.3% |
| 6 | 1270 | 5.7% |
| . | 1134 | 5.1% |
| Y | 1133 | 5.1% |
| U | 1126 | 5.0% |
| t | 983 | 4.4% |
| : | 983 | 4.4% |
| Other values (22) | 7077 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8823 | |
| Decimal Number | 6801 | |
| Uppercase Letter | 2287 | 10.2% |
| Other Punctuation | 2117 | 9.5% |
| Space Separator | 1948 | 8.7% |
| Math Symbol | 386 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2925 | |
| s | 1930 | |
| t | 983 | 11.1% |
| a | 977 | 11.1% |
| h | 971 | 11.0% |
| m | 965 | 10.9% |
| r | 18 | 0.2% |
| p | 12 | 0.1% |
| c | 12 | 0.1% |
| i | 12 | 0.1% |
| Other values (2) | 18 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1853 | |
| 6 | 1270 | |
| 5 | 693 | 10.2% |
| 1 | 596 | 8.8% |
| 4 | 587 | 8.6% |
| 2 | 450 | 6.6% |
| 9 | 375 | 5.5% |
| 7 | 364 | 5.4% |
| 3 | 322 | 4.7% |
| 8 | 291 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 1133 | |
| U | 1126 | |
| A | 7 | 0.3% |
| P | 7 | 0.3% |
| R | 7 | 0.3% |
| M | 7 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1134 | |
| : | 983 |
Space Separator
| Value | Count | Frequency (%) |
| 1948 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 386 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11252 | |
| Latin | 11110 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2925 | |
| s | 1930 | |
| Y | 1133 | 10.2% |
| U | 1126 | 10.1% |
| t | 983 | 8.8% |
| a | 977 | 8.8% |
| h | 971 | 8.7% |
| m | 965 | 8.7% |
| r | 18 | 0.2% |
| p | 12 | 0.1% |
| Other values (8) | 70 | 0.6% |
Common
| Value | Count | Frequency (%) |
| 1948 | ||
| 0 | 1853 | |
| 6 | 1270 | |
| . | 1134 | |
| : | 983 | |
| 5 | 693 | 6.2% |
| 1 | 596 | 5.3% |
| 4 | 587 | 5.2% |
| 2 | 450 | 4.0% |
| | | 386 | 3.4% |
| Other values (4) | 1352 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22362 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2925 | |
| 1948 | 8.7% | |
| s | 1930 | 8.6% |
| 0 | 1853 | 8.3% |
| 6 | 1270 | 5.7% |
| . | 1134 | 5.1% |
| Y | 1133 | 5.1% |
| U | 1126 | 5.0% |
| t | 983 | 4.4% |
| : | 983 | 4.4% |
| Other values (22) | 7077 |
| Distinct | 186516 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 509 |
|---|---|
| Median length | 29 |
| Mean length | 32.71075811 |
| Min length | 24 |
Unique
| Unique | 186503 ? |
|---|---|
| Unique (%) | > 99.9% |
Sample
| 1st row | YU number 36650; lot count 1 |
|---|---|
| 2nd row | CBS number 28950; lot count 1 |
| 3rd row | YU number 70008; lot count 1 |
| 4th row | YU number 204399; lot count 1 |
| 5th row | YU number 175465; lot count 1 |
| Value | Count | Frequency (%) |
| 1 | 186654 | |
| number | 186532 | |
| lot | 186530 | |
| count | 186529 | |
| yu | 148138 | |
| cbs | 38404 | 3.2% |
| tall | 1591 | 0.1% |
| dryopteris | 1419 | 0.1% |
| ca | 1393 | 0.1% |
| carex | 1306 | 0.1% |
| Other values (156716) | 268264 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1020231 | ||
| o | 410162 | 6.7% |
| t | 410146 | 6.7% |
| n | 405993 | 6.7% |
| u | 403292 | 6.6% |
| 1 | 323313 | 5.3% |
| e | 243622 | 4.0% |
| r | 227752 | 3.7% |
| l | 226680 | 3.7% |
| ; | 215837 | 3.5% |
| Other values (79) | 2214477 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3196751 | |
| Decimal Number | 1190785 | 19.5% |
| Space Separator | 1020231 | 16.7% |
| Uppercase Letter | 457401 | 7.5% |
| Other Punctuation | 232829 | 3.8% |
| Math Symbol | 2789 | < 0.1% |
| Dash Punctuation | 563 | < 0.1% |
| Close Punctuation | 64 | < 0.1% |
| Open Punctuation | 64 | < 0.1% |
| Connector Punctuation | 27 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 410162 | |
| t | 410146 | |
| n | 405993 | |
| u | 403292 | |
| e | 243622 | |
| r | 227752 | |
| l | 226680 | |
| c | 212563 | |
| m | 212319 | |
| b | 193316 | |
| Other values (18) | 250906 |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 151690 | |
| U | 149387 | |
| C | 43689 | 9.6% |
| S | 40747 | 8.9% |
| B | 39685 | 8.7% |
| P | 6322 | 1.4% |
| A | 4630 | 1.0% |
| D | 3750 | 0.8% |
| M | 3438 | 0.8% |
| H | 2160 | 0.5% |
| Other values (16) | 11903 | 2.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 215837 | |
| . | 11916 | 5.1% |
| , | 3123 | 1.3% |
| : | 1459 | 0.6% |
| & | 336 | 0.1% |
| / | 76 | < 0.1% |
| ' | 40 | < 0.1% |
| " | 24 | < 0.1% |
| % | 7 | < 0.1% |
| ? | 7 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 323313 | |
| 2 | 153590 | |
| 3 | 104319 | 8.8% |
| 4 | 93865 | 7.9% |
| 0 | 89929 | 7.6% |
| 5 | 89211 | 7.5% |
| 8 | 86892 | 7.3% |
| 6 | 85406 | 7.2% |
| 7 | 84783 | 7.1% |
| 9 | 79477 | 6.7% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 2773 | |
| ~ | 5 | 0.2% |
| < | 4 | 0.1% |
| + | 4 | 0.1% |
| > | 3 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 562 | |
| – | 1 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 63 | |
| ] | 1 | 1.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 63 | |
| [ | 1 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 1020231 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 27 |
Other Number
| Value | Count | Frequency (%) |
| ₂ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3654152 | |
| Common | 2447353 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 410162 | |
| t | 410146 | |
| n | 405993 | |
| u | 403292 | |
| e | 243622 | 6.7% |
| r | 227752 | 6.2% |
| l | 226680 | 6.2% |
| c | 212563 | 5.8% |
| m | 212319 | 5.8% |
| b | 193316 | 5.3% |
| Other values (44) | 708307 |
Common
| Value | Count | Frequency (%) |
| 1020231 | ||
| 1 | 323313 | 13.2% |
| ; | 215837 | 8.8% |
| 2 | 153590 | 6.3% |
| 3 | 104319 | 4.3% |
| 4 | 93865 | 3.8% |
| 0 | 89929 | 3.7% |
| 5 | 89211 | 3.6% |
| 8 | 86892 | 3.6% |
| 6 | 85406 | 3.5% |
| Other values (25) | 184760 | 7.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6101443 | |
| None | 61 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1020231 | ||
| o | 410162 | 6.7% |
| t | 410146 | 6.7% |
| n | 405993 | 6.7% |
| u | 403292 | 6.6% |
| 1 | 323313 | 5.3% |
| e | 243622 | 4.0% |
| r | 227752 | 3.7% |
| l | 226680 | 3.7% |
| ; | 215837 | 3.5% |
| Other values (75) | 2214415 |
None
| Value | Count | Frequency (%) |
| á | 30 | |
| ñ | 30 | |
| ₂ | 1 | 1.6% |
Punctuation
| Value | Count | Frequency (%) |
| – | 1 |
| Distinct | 17165 |
|---|---|
| Distinct (%) | 9.2% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Memory size | 1.4 MiB |
Length
| Max length | 172 |
|---|---|
| Median length | 139 |
| Mean length | 16.42291339 |
| Min length | 3 |
Unique
| Unique | 8494 ? |
|---|---|
| Unique (%) | 4.6% |
Sample
| 1st row | Luzula bulbosa |
|---|---|
| 2nd row | Gentiana clausa |
| 3rd row | Carex muhlenbergii|Carex muhlenbergii |
| 4th row | Lophocolea minor |
| 5th row | Plantae |
| Value | Count | Frequency (%) |
| plantae | 28374 | 8.5% |
| carex | 8803 | 2.6% |
| var | 4014 | 1.2% |
| dryopteris | 2392 | 0.7% |
| sphagnum | 2360 | 0.7% |
| juncus | 1814 | 0.5% |
| frullania | 1708 | 0.5% |
| asplenium | 1557 | 0.5% |
| scapania | 1517 | 0.5% |
| canadensis | 1511 | 0.5% |
| Other values (14275) | 280732 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 399621 | |
| i | 270647 | 8.8% |
| e | 211263 | 6.9% |
| l | 201909 | 6.6% |
| r | 180523 | 5.9% |
| n | 175073 | 5.7% |
| u | 167028 | 5.5% |
| o | 161624 | 5.3% |
| s | 159635 | 5.2% |
| t | 149512 | 4.9% |
| Other values (49) | 986219 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2714576 | |
| Uppercase Letter | 190758 | 6.2% |
| Space Separator | 148271 | 4.8% |
| Other Punctuation | 4477 | 0.1% |
| Math Symbol | 4244 | 0.1% |
| Dash Punctuation | 726 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 399621 | |
| i | 270647 | |
| e | 211263 | 7.8% |
| l | 201909 | 7.4% |
| r | 180523 | 6.7% |
| n | 175073 | 6.4% |
| u | 167028 | 6.2% |
| o | 161624 | 6.0% |
| s | 159635 | 5.9% |
| t | 149512 | 5.5% |
| Other values (16) | 637741 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 49853 | |
| C | 26611 | |
| S | 17407 | 9.1% |
| A | 14516 | 7.6% |
| L | 11096 | 5.8% |
| D | 7958 | 4.2% |
| R | 7175 | 3.8% |
| E | 7079 | 3.7% |
| B | 6558 | 3.4% |
| M | 6188 | 3.2% |
| Other values (16) | 36317 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4475 | |
| ? | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 148271 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 4244 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 726 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2905334 | |
| Common | 157720 | 5.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 399621 | |
| i | 270647 | 9.3% |
| e | 211263 | 7.3% |
| l | 201909 | 6.9% |
| r | 180523 | 6.2% |
| n | 175073 | 6.0% |
| u | 167028 | 5.7% |
| o | 161624 | 5.6% |
| s | 159635 | 5.5% |
| t | 149512 | 5.1% |
| Other values (42) | 828499 |
Common
| Value | Count | Frequency (%) |
| 148271 | ||
| . | 4475 | 2.8% |
| | | 4244 | 2.7% |
| - | 726 | 0.5% |
| ? | 2 | < 0.1% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3063054 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 399621 | |
| i | 270647 | 8.8% |
| e | 211263 | 6.9% |
| l | 201909 | 6.6% |
| r | 180523 | 5.9% |
| n | 175073 | 5.7% |
| u | 167028 | 5.5% |
| o | 161624 | 5.3% |
| s | 159635 | 5.2% |
| t | 149512 | 4.9% |
| Other values (49) | 986219 |
eventDate
Text
Missing 
| Distinct | 19106 |
|---|---|
| Distinct (%) | 18.6% |
| Missing | 84019 |
| Missing (%) | 45.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 9.320905278 |
| Min length | 4 |
Unique
| Unique | 7461 ? |
|---|---|
| Unique (%) | 7.3% |
Sample
| 1st row | 1919-10-01 |
|---|---|
| 2nd row | 1822 |
| 3rd row | 1909-05-27 |
| 4th row | 1905-07-23 |
| 5th row | 1901-09-02 |
| Value | Count | Frequency (%) |
| 1822 | 660 | 0.6% |
| 1920 | 497 | 0.5% |
| 1914 | 302 | 0.3% |
| 1875 | 288 | 0.3% |
| 1893 | 280 | 0.3% |
| 1902-08-20/1902-08-25 | 228 | 0.2% |
| 1859 | 225 | 0.2% |
| 1876 | 213 | 0.2% |
| 1915 | 208 | 0.2% |
| 1862 | 205 | 0.2% |
| Other values (19096) | 99404 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 179522 | |
| 1 | 178315 | |
| 0 | 166698 | |
| 9 | 118068 | |
| 8 | 72267 | |
| 2 | 64463 | 6.7% |
| 7 | 42922 | 4.5% |
| 6 | 36805 | 3.9% |
| 3 | 35566 | 3.7% |
| 5 | 34423 | 3.6% |
| Other values (2) | 26437 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 774588 | |
| Dash Punctuation | 179522 | 18.8% |
| Other Punctuation | 1376 | 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 178315 | |
| 0 | 166698 | |
| 9 | 118068 | |
| 8 | 72267 | |
| 2 | 64463 | 8.3% |
| 7 | 42922 | 5.5% |
| 6 | 36805 | 4.8% |
| 3 | 35566 | 4.6% |
| 5 | 34423 | 4.4% |
| 4 | 25061 | 3.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 179522 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1376 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 955486 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 179522 | |
| 1 | 178315 | |
| 0 | 166698 | |
| 9 | 118068 | |
| 8 | 72267 | |
| 2 | 64463 | 6.7% |
| 7 | 42922 | 4.5% |
| 6 | 36805 | 3.9% |
| 3 | 35566 | 3.7% |
| 5 | 34423 | 3.6% |
| Other values (2) | 26437 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 955486 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 179522 | |
| 1 | 178315 | |
| 0 | 166698 | |
| 9 | 118068 | |
| 8 | 72267 | |
| 2 | 64463 | 6.7% |
| 7 | 42922 | 4.5% |
| 6 | 36805 | 3.9% |
| 3 | 35566 | 3.7% |
| 5 | 34423 | 3.6% |
| Other values (2) | 26437 | 2.8% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 103374 |
| Missing (%) | 55.4% |
| Memory size | 1.4 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.925536648 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 274 |
|---|---|
| 2nd row | 147 |
| 3rd row | 204 |
| 4th row | 245 |
| 5th row | 175 |
| Value | Count | Frequency (%) |
| 232 | 837 | 1.0% |
| 150 | 753 | 0.9% |
| 201 | 751 | 0.9% |
| 186 | 717 | 0.9% |
| 249 | 701 | 0.8% |
| 185 | 700 | 0.8% |
| 200 | 669 | 0.8% |
| 193 | 651 | 0.8% |
| 172 | 624 | 0.8% |
| 151 | 607 | 0.7% |
| Other values (356) | 76145 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 55350 | |
| 1 | 51427 | |
| 3 | 20962 | 8.6% |
| 5 | 18239 | 7.5% |
| 6 | 17248 | 7.1% |
| 4 | 17190 | 7.1% |
| 0 | 16075 | 6.6% |
| 9 | 15802 | 6.5% |
| 8 | 15598 | 6.4% |
| 7 | 15382 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 243273 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 55350 | |
| 1 | 51427 | |
| 3 | 20962 | 8.6% |
| 5 | 18239 | 7.5% |
| 6 | 17248 | 7.1% |
| 4 | 17190 | 7.1% |
| 0 | 16075 | 6.6% |
| 9 | 15802 | 6.5% |
| 8 | 15598 | 6.4% |
| 7 | 15382 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 243273 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 55350 | |
| 1 | 51427 | |
| 3 | 20962 | 8.6% |
| 5 | 18239 | 7.5% |
| 6 | 17248 | 7.1% |
| 4 | 17190 | 7.1% |
| 0 | 16075 | 6.6% |
| 9 | 15802 | 6.5% |
| 8 | 15598 | 6.4% |
| 7 | 15382 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 243273 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 55350 | |
| 1 | 51427 | |
| 3 | 20962 | 8.6% |
| 5 | 18239 | 7.5% |
| 6 | 17248 | 7.1% |
| 4 | 17190 | 7.1% |
| 0 | 16075 | 6.6% |
| 9 | 15802 | 6.5% |
| 8 | 15598 | 6.4% |
| 7 | 15382 | 6.3% |
endDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 103374 |
| Missing (%) | 55.4% |
| Memory size | 1.4 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.925717034 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 274 |
|---|---|
| 2nd row | 147 |
| 3rd row | 204 |
| 4th row | 245 |
| 5th row | 175 |
| Value | Count | Frequency (%) |
| 201 | 752 | 0.9% |
| 150 | 742 | 0.9% |
| 249 | 720 | 0.9% |
| 186 | 718 | 0.9% |
| 237 | 689 | 0.8% |
| 185 | 688 | 0.8% |
| 200 | 677 | 0.8% |
| 193 | 670 | 0.8% |
| 172 | 626 | 0.8% |
| 232 | 608 | 0.7% |
| Other values (356) | 76265 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 55269 | |
| 1 | 51209 | |
| 3 | 20925 | 8.6% |
| 5 | 18234 | 7.5% |
| 6 | 17278 | 7.1% |
| 4 | 17165 | 7.1% |
| 0 | 16024 | 6.6% |
| 9 | 15862 | 6.5% |
| 7 | 15690 | 6.4% |
| 8 | 15632 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 243288 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 55269 | |
| 1 | 51209 | |
| 3 | 20925 | 8.6% |
| 5 | 18234 | 7.5% |
| 6 | 17278 | 7.1% |
| 4 | 17165 | 7.1% |
| 0 | 16024 | 6.6% |
| 9 | 15862 | 6.5% |
| 7 | 15690 | 6.4% |
| 8 | 15632 | 6.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 243288 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 55269 | |
| 1 | 51209 | |
| 3 | 20925 | 8.6% |
| 5 | 18234 | 7.5% |
| 6 | 17278 | 7.1% |
| 4 | 17165 | 7.1% |
| 0 | 16024 | 6.6% |
| 9 | 15862 | 6.5% |
| 7 | 15690 | 6.4% |
| 8 | 15632 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 243288 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 55269 | |
| 1 | 51209 | |
| 3 | 20925 | 8.6% |
| 5 | 18234 | 7.5% |
| 6 | 17278 | 7.1% |
| 4 | 17165 | 7.1% |
| 0 | 16024 | 6.6% |
| 9 | 15862 | 6.5% |
| 7 | 15690 | 6.4% |
| 8 | 15632 | 6.4% |
year
Text
Missing 
| Distinct | 206 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 84248 |
| Missing (%) | 45.2% |
| Memory size | 1.4 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1919 |
|---|---|
| 2nd row | 1822 |
| 3rd row | 1909 |
| 4th row | 1905 |
| 5th row | 1901 |
| Value | Count | Frequency (%) |
| 1903 | 3557 | 3.5% |
| 1908 | 3427 | 3.4% |
| 1906 | 3121 | 3.1% |
| 1909 | 3107 | 3.0% |
| 1907 | 2461 | 2.4% |
| 1905 | 2453 | 2.4% |
| 1902 | 2442 | 2.4% |
| 1904 | 2252 | 2.2% |
| 1901 | 2247 | 2.2% |
| 1910 | 2160 | 2.1% |
| Other values (196) | 75054 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 125652 | |
| 9 | 95339 | |
| 8 | 44654 | 10.9% |
| 0 | 40689 | 9.9% |
| 2 | 25566 | 6.2% |
| 3 | 20238 | 4.9% |
| 7 | 16894 | 4.1% |
| 5 | 14797 | 3.6% |
| 6 | 13450 | 3.3% |
| 4 | 11845 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 409124 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 125652 | |
| 9 | 95339 | |
| 8 | 44654 | 10.9% |
| 0 | 40689 | 9.9% |
| 2 | 25566 | 6.2% |
| 3 | 20238 | 4.9% |
| 7 | 16894 | 4.1% |
| 5 | 14797 | 3.6% |
| 6 | 13450 | 3.3% |
| 4 | 11845 | 2.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 409124 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 125652 | |
| 9 | 95339 | |
| 8 | 44654 | 10.9% |
| 0 | 40689 | 9.9% |
| 2 | 25566 | 6.2% |
| 3 | 20238 | 4.9% |
| 7 | 16894 | 4.1% |
| 5 | 14797 | 3.6% |
| 6 | 13450 | 3.3% |
| 4 | 11845 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 409124 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 125652 | |
| 9 | 95339 | |
| 8 | 44654 | 10.9% |
| 0 | 40689 | 9.9% |
| 2 | 25566 | 6.2% |
| 3 | 20238 | 4.9% |
| 7 | 16894 | 4.1% |
| 5 | 14797 | 3.6% |
| 6 | 13450 | 3.3% |
| 4 | 11845 | 2.9% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 93636 |
| Missing (%) | 50.2% |
| Memory size | 1.4 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.091094054 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 10 |
|---|---|
| 2nd row | 5 |
| 3rd row | 7 |
| 4th row | 9 |
| 5th row | 6 |
| Value | Count | Frequency (%) |
| 8 | 18299 | |
| 7 | 17323 | |
| 6 | 14919 | |
| 9 | 12921 | |
| 5 | 10473 | |
| 10 | 5121 | 5.5% |
| 4 | 4638 | 5.0% |
| 3 | 2657 | 2.9% |
| 11 | 2066 | 2.2% |
| 2 | 1834 | 2.0% |
| Other values (2) | 2642 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 18299 | |
| 7 | 17323 | |
| 6 | 14919 | |
| 9 | 12921 | |
| 1 | 11895 | |
| 5 | 10473 | |
| 0 | 5121 | 5.1% |
| 4 | 4638 | 4.6% |
| 2 | 3109 | 3.1% |
| 3 | 2657 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 101355 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 18299 | |
| 7 | 17323 | |
| 6 | 14919 | |
| 9 | 12921 | |
| 1 | 11895 | |
| 5 | 10473 | |
| 0 | 5121 | 5.1% |
| 4 | 4638 | 4.6% |
| 2 | 3109 | 3.1% |
| 3 | 2657 | 2.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 101355 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8 | 18299 | |
| 7 | 17323 | |
| 6 | 14919 | |
| 9 | 12921 | |
| 1 | 11895 | |
| 5 | 10473 | |
| 0 | 5121 | 5.1% |
| 4 | 4638 | 4.6% |
| 2 | 3109 | 3.1% |
| 3 | 2657 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 101355 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 18299 | |
| 7 | 17323 | |
| 6 | 14919 | |
| 9 | 12921 | |
| 1 | 11895 | |
| 5 | 10473 | |
| 0 | 5121 | 5.1% |
| 4 | 4638 | 4.6% |
| 2 | 3109 | 3.1% |
| 3 | 2657 | 2.6% |
day
Text
Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 104750 |
| Missing (%) | 56.2% |
| Memory size | 1.4 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.717934922 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 27 |
| 3rd row | 23 |
| 4th row | 2 |
| 5th row | 24 |
| Value | Count | Frequency (%) |
| 20 | 3304 | 4.0% |
| 12 | 3096 | 3.8% |
| 30 | 3014 | 3.7% |
| 10 | 2927 | 3.6% |
| 19 | 2906 | 3.6% |
| 15 | 2903 | 3.5% |
| 17 | 2855 | 3.5% |
| 13 | 2816 | 3.4% |
| 8 | 2799 | 3.4% |
| 4 | 2741 | 3.4% |
| Other values (21) | 52418 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 37526 | |
| 2 | 33938 | |
| 3 | 12087 | 8.6% |
| 0 | 9245 | 6.6% |
| 5 | 8118 | 5.8% |
| 4 | 8099 | 5.8% |
| 7 | 8096 | 5.8% |
| 8 | 7960 | 5.7% |
| 9 | 7762 | 5.5% |
| 6 | 7660 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 140491 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 37526 | |
| 2 | 33938 | |
| 3 | 12087 | 8.6% |
| 0 | 9245 | 6.6% |
| 5 | 8118 | 5.8% |
| 4 | 8099 | 5.8% |
| 7 | 8096 | 5.8% |
| 8 | 7960 | 5.7% |
| 9 | 7762 | 5.5% |
| 6 | 7660 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 140491 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 37526 | |
| 2 | 33938 | |
| 3 | 12087 | 8.6% |
| 0 | 9245 | 6.6% |
| 5 | 8118 | 5.8% |
| 4 | 8099 | 5.8% |
| 7 | 8096 | 5.8% |
| 8 | 7960 | 5.7% |
| 9 | 7762 | 5.5% |
| 6 | 7660 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 140491 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 37526 | |
| 2 | 33938 | |
| 3 | 12087 | 8.6% |
| 0 | 9245 | 6.6% |
| 5 | 8118 | 5.8% |
| 4 | 8099 | 5.8% |
| 7 | 8096 | 5.8% |
| 8 | 7960 | 5.7% |
| 9 | 7762 | 5.5% |
| 6 | 7660 | 5.5% |
habitat
Text
Missing 
| Distinct | 14351 |
|---|---|
| Distinct (%) | 49.8% |
| Missing | 157729 |
| Missing (%) | 84.6% |
| Memory size | 1.4 MiB |
Length
| Max length | 242 |
|---|---|
| Median length | 188 |
| Mean length | 21.31243056 |
| Min length | 3 |
Unique
| Unique | 11617 ? |
|---|---|
| Unique (%) | 40.3% |
Sample
| 1st row | Earth |
|---|---|
| 2nd row | Edge of lake; Moist soil |
| 3rd row | On high cliff |
| 4th row | Primary montane forest. |
| 5th row | Sur les arbres (on the trees) |
| Value | Count | Frequency (%) |
| on | 15468 | 13.4% |
| in | 6301 | 5.5% |
| of | 4480 | 3.9% |
| rocks | 4154 | 3.6% |
| a | 1945 | 1.7% |
| woods | 1920 | 1.7% |
| wet | 1736 | 1.5% |
| trees | 1636 | 1.4% |
| and | 1457 | 1.3% |
| tree | 1397 | 1.2% |
| Other values (4244) | 74614 |
Most occurring characters
| Value | Count | Frequency (%) |
| 86308 | ||
| e | 49629 | 8.1% |
| o | 47740 | 7.8% |
| n | 47247 | 7.7% |
| a | 38799 | 6.3% |
| s | 38696 | 6.3% |
| r | 36877 | 6.0% |
| t | 27255 | 4.4% |
| i | 24024 | 3.9% |
| d | 23439 | 3.8% |
| Other values (86) | 193784 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 476716 | |
| Space Separator | 86308 | 14.1% |
| Uppercase Letter | 35456 | 5.8% |
| Other Punctuation | 12410 | 2.0% |
| Dash Punctuation | 868 | 0.1% |
| Close Punctuation | 824 | 0.1% |
| Open Punctuation | 816 | 0.1% |
| Decimal Number | 335 | 0.1% |
| Math Symbol | 43 | < 0.1% |
| Currency Symbol | 11 | < 0.1% |
| Other values (3) | 11 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 49629 | |
| o | 47740 | 10.0% |
| n | 47247 | 9.9% |
| a | 38799 | 8.1% |
| s | 38696 | 8.1% |
| r | 36877 | 7.7% |
| t | 27255 | 5.7% |
| i | 24024 | 5.0% |
| d | 23439 | 4.9% |
| l | 20507 | 4.3% |
| Other values (17) | 122503 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 14615 | |
| I | 2748 | 7.8% |
| S | 2363 | 6.7% |
| B | 2146 | 6.1% |
| R | 1640 | 4.6% |
| A | 1512 | 4.3% |
| C | 1306 | 3.7% |
| W | 1225 | 3.5% |
| M | 1196 | 3.4% |
| D | 910 | 2.6% |
| Other values (17) | 5795 | 16.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5273 | |
| , | 3827 | |
| ; | 2610 | |
| / | 316 | 2.5% |
| & | 127 | 1.0% |
| ? | 91 | 0.7% |
| " | 73 | 0.6% |
| ' | 54 | 0.4% |
| : | 34 | 0.3% |
| ¡ | 3 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 67 | |
| 2 | 55 | |
| 0 | 55 | |
| 3 | 52 | |
| 4 | 32 | |
| 6 | 20 | 6.0% |
| 5 | 20 | 6.0% |
| 9 | 15 | 4.5% |
| 8 | 13 | 3.9% |
| 7 | 6 | 1.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 796 | |
| ‚ | 10 | 1.2% |
| [ | 9 | 1.1% |
| { | 1 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 38 | |
| = | 4 | 9.3% |
| < | 1 | 2.3% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¤ | 6 | |
| ¢ | 3 | |
| £ | 2 | 18.2% |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 3 | |
| “ | 2 | |
| ‹ | 1 | 16.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 866 | |
| — | 2 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 814 | |
| ] | 10 | 1.2% |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 3 | |
| ’ | 1 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 86308 |
Modifier Letter
| Value | Count | Frequency (%) |
| ˆ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 512172 | |
| Common | 101626 | 16.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 49629 | 9.7% |
| o | 47740 | 9.3% |
| n | 47247 | 9.2% |
| a | 38799 | 7.6% |
| s | 38696 | 7.6% |
| r | 36877 | 7.2% |
| t | 27255 | 5.3% |
| i | 24024 | 4.7% |
| d | 23439 | 4.6% |
| l | 20507 | 4.0% |
| Other values (44) | 157959 |
Common
| Value | Count | Frequency (%) |
| 86308 | ||
| . | 5273 | 5.2% |
| , | 3827 | 3.8% |
| ; | 2610 | 2.6% |
| - | 866 | 0.9% |
| ) | 814 | 0.8% |
| ( | 796 | 0.8% |
| / | 316 | 0.3% |
| & | 127 | 0.1% |
| ? | 91 | 0.1% |
| Other values (32) | 598 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 613755 | |
| Punctuation | 22 | < 0.1% |
| None | 20 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 86308 | ||
| e | 49629 | 8.1% |
| o | 47740 | 7.8% |
| n | 47247 | 7.7% |
| a | 38799 | 6.3% |
| s | 38696 | 6.3% |
| r | 36877 | 6.0% |
| t | 27255 | 4.4% |
| i | 24024 | 3.9% |
| d | 23439 | 3.8% |
| Other values (72) | 193741 |
Punctuation
| Value | Count | Frequency (%) |
| ‚ | 10 | |
| ” | 3 | 13.6% |
| ‘ | 3 | 13.6% |
| “ | 2 | 9.1% |
| — | 2 | 9.1% |
| ‹ | 1 | 4.5% |
| ’ | 1 | 4.5% |
None
| Value | Count | Frequency (%) |
| ¤ | 6 | |
| Š | 3 | |
| ¡ | 3 | |
| ø | 3 | |
| ¢ | 3 | |
| £ | 2 | 10.0% |
Modifier Letters
| Value | Count | Frequency (%) |
| ˆ | 1 |
higherGeography
Text
Missing 
| Distinct | 3946 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 72099 |
| Missing (%) | 38.7% |
| Memory size | 1.4 MiB |
Length
| Max length | 100 |
|---|---|
| Median length | 90 |
| Mean length | 51.15013545 |
| Min length | 4 |
Unique
| Unique | 1598 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | North America; USA; Connecticut; New London County; Salem |
|---|---|
| 2nd row | North America; USA; Connecticut; New Haven County; New Haven |
| 3rd row | North America; Canada; British Columbia |
| 4th row | North America; USA; Connecticut; Litchfield County; Washington |
| 5th row | North America; USA; Connecticut; Hartford County; Southington |
| Value | Count | Frequency (%) |
| north | 111995 | |
| america | 109184 | |
| usa | 99054 | |
| county | 86823 | |
| connecticut | 62098 | 8.0% |
| new | 41413 | 5.3% |
| haven | 29950 | 3.9% |
| hartford | 12411 | 1.6% |
| litchfield | 10261 | 1.3% |
| fairfield | 7167 | 0.9% |
| Other values (2947) | 204171 |
Most occurring characters
| Value | Count | Frequency (%) |
| 660097 | 11.3% | |
| t | 419182 | 7.2% |
| o | 390984 | 6.7% |
| ; | 389920 | 6.7% |
| n | 368644 | 6.3% |
| e | 355954 | 6.1% |
| r | 341889 | 5.8% |
| a | 317910 | 5.4% |
| i | 314775 | 5.4% |
| c | 274503 | 4.7% |
| Other values (58) | 2019252 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3828543 | |
| Uppercase Letter | 972588 | 16.6% |
| Space Separator | 660097 | 11.3% |
| Other Punctuation | 390992 | 6.7% |
| Dash Punctuation | 882 | < 0.1% |
| Open Punctuation | 4 | < 0.1% |
| Close Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 419182 | |
| o | 390984 | |
| n | 368644 | |
| e | 355954 | |
| r | 341889 | |
| a | 317910 | |
| i | 314775 | |
| c | 274503 | 7.2% |
| u | 191407 | 5.0% |
| h | 158690 | 4.1% |
| Other values (22) | 694605 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 217147 | |
| C | 174483 | |
| N | 158609 | |
| S | 122261 | |
| U | 100685 | |
| H | 51363 | 5.3% |
| M | 24299 | 2.5% |
| L | 22479 | 2.3% |
| W | 14997 | 1.5% |
| F | 14090 | 1.4% |
| Other values (16) | 72175 | 7.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 389920 | |
| ' | 560 | 0.1% |
| . | 316 | 0.1% |
| & | 196 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 2 | |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 2 | |
| ) | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 660097 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 882 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4801131 | |
| Common | 1051979 | 18.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 419182 | 8.7% |
| o | 390984 | 8.1% |
| n | 368644 | 7.7% |
| e | 355954 | 7.4% |
| r | 341889 | 7.1% |
| a | 317910 | 6.6% |
| i | 314775 | 6.6% |
| c | 274503 | 5.7% |
| A | 217147 | 4.5% |
| u | 191407 | 4.0% |
| Other values (48) | 1608736 |
Common
| Value | Count | Frequency (%) |
| 660097 | ||
| ; | 389920 | |
| - | 882 | 0.1% |
| ' | 560 | 0.1% |
| . | 316 | < 0.1% |
| & | 196 | < 0.1% |
| [ | 2 | < 0.1% |
| ] | 2 | < 0.1% |
| ( | 2 | < 0.1% |
| ) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5852731 | |
| None | 379 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 660097 | 11.3% | |
| t | 419182 | 7.2% |
| o | 390984 | 6.7% |
| ; | 389920 | 6.7% |
| n | 368644 | 6.3% |
| e | 355954 | 6.1% |
| r | 341889 | 5.8% |
| a | 317910 | 5.4% |
| i | 314775 | 5.4% |
| c | 274503 | 4.7% |
| Other values (52) | 2018873 |
None
| Value | Count | Frequency (%) |
| á | 110 | |
| í | 98 | |
| ü | 97 | |
| é | 36 | 9.5% |
| ó | 36 | 9.5% |
| ç | 2 | 0.5% |
continent
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 73143 |
| Missing (%) | 39.2% |
| Memory size | 1.4 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 12.76341876 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 108995 | |
| europe | 1671 | 1.5% |
| asia | 1008 | 0.9% |
| south_america | 749 | 0.7% |
| oceania | 665 | 0.6% |
| africa | 293 | 0.3% |
| antarctica | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 223435 | |
| R | 220708 | |
| E | 113751 | |
| O | 112080 | |
| I | 111715 | |
| C | 110712 | |
| T | 109754 | |
| H | 109744 | |
| _ | 109744 | |
| M | 109744 | |
| Other values (5) | 115806 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1337449 | |
| Connector Punctuation | 109744 | 7.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 223435 | |
| R | 220708 | |
| E | 113751 | |
| O | 112080 | |
| I | 111715 | |
| C | 110712 | |
| T | 109754 | |
| H | 109744 | |
| M | 109744 | |
| N | 109665 | |
| Other values (4) | 6141 | 0.5% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 109744 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1337449 | |
| Common | 109744 | 7.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 223435 | |
| R | 220708 | |
| E | 113751 | |
| O | 112080 | |
| I | 111715 | |
| C | 110712 | |
| T | 109754 | |
| H | 109744 | |
| M | 109744 | |
| N | 109665 | |
| Other values (4) | 6141 | 0.5% |
Common
| Value | Count | Frequency (%) |
| _ | 109744 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1447193 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 223435 | |
| R | 220708 | |
| E | 113751 | |
| O | 112080 | |
| I | 111715 | |
| C | 110712 | |
| T | 109754 | |
| H | 109744 | |
| _ | 109744 | |
| M | 109744 | |
| Other values (5) | 115806 |
waterBody
Text
Missing 
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 183495 |
| Missing (%) | 98.4% |
| Memory size | 1.4 MiB |
Length
| Max length | 33 |
|---|---|
| Median length | 32 |
| Mean length | 22.4996704 |
| Min length | 12 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Atlantic Ocean; Caribbean Sea |
|---|---|
| 2nd row | Atlantic Ocean; Sargasso Sea |
| 3rd row | Atlantic Ocean |
| 4th row | Atlantic Ocean; Caribbean Sea |
| 5th row | Atlantic Ocean; Adriatic Sea |
| Value | Count | Frequency (%) |
| ocean | 3034 | |
| atlantic | 2509 | |
| sea | 1009 | 10.2% |
| caribbean | 673 | 6.8% |
| long | 503 | 5.1% |
| island | 503 | 5.1% |
| sound | 503 | 5.1% |
| pacific | 450 | 4.5% |
| adriatic | 126 | 1.3% |
| sargasso | 123 | 1.2% |
| Other values (15) | 478 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 9455 | |
| n | 8029 | |
| 6877 | ||
| c | 6672 | |
| t | 5226 | 7.7% |
| e | 5055 | 7.4% |
| i | 4589 | 6.7% |
| l | 3120 | 4.6% |
| O | 3036 | 4.4% |
| A | 2636 | 3.9% |
| Other values (26) | 13569 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 49965 | |
| Uppercase Letter | 9805 | 14.4% |
| Space Separator | 6877 | 10.1% |
| Other Punctuation | 1617 | 2.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9455 | |
| n | 8029 | |
| c | 6672 | |
| t | 5226 | |
| e | 5055 | |
| i | 4589 | |
| l | 3120 | 6.2% |
| b | 1347 | 2.7% |
| o | 1339 | 2.7% |
| d | 1289 | 2.6% |
| Other values (11) | 3844 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 3036 | |
| A | 2636 | |
| S | 1637 | |
| C | 675 | 6.9% |
| I | 577 | 5.9% |
| L | 503 | 5.1% |
| P | 451 | 4.6% |
| M | 176 | 1.8% |
| G | 105 | 1.1% |
| R | 6 | 0.1% |
| Other values (3) | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 6877 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 1617 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 59770 | |
| Common | 8494 | 12.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 9455 | |
| n | 8029 | |
| c | 6672 | |
| t | 5226 | |
| e | 5055 | |
| i | 4589 | |
| l | 3120 | 5.2% |
| O | 3036 | 5.1% |
| A | 2636 | 4.4% |
| S | 1637 | 2.7% |
| Other values (24) | 10315 |
Common
| Value | Count | Frequency (%) |
| 6877 | ||
| ; | 1617 | 19.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68264 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 9455 | |
| n | 8029 | |
| 6877 | ||
| c | 6672 | |
| t | 5226 | 7.7% |
| e | 5055 | 7.4% |
| i | 4589 | 6.7% |
| l | 3120 | 4.6% |
| O | 3036 | 4.4% |
| A | 2636 | 3.9% |
| Other values (26) | 13569 |
countryCode
Text
Missing 
| Distinct | 107 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 72482 |
| Missing (%) | 38.9% |
| Memory size | 1.4 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 16 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | CA |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 98166 | |
| ca | 6370 | 5.6% |
| mx | 1398 | 1.2% |
| cu | 1385 | 1.2% |
| pr | 883 | 0.8% |
| cn | 726 | 0.6% |
| gb | 643 | 0.6% |
| au | 497 | 0.4% |
| bm | 438 | 0.4% |
| fr | 405 | 0.4% |
| Other values (97) | 3136 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 100202 | |
| S | 98426 | |
| C | 8932 | 3.9% |
| A | 7095 | 3.1% |
| M | 2280 | 1.0% |
| R | 1558 | 0.7% |
| B | 1489 | 0.7% |
| X | 1404 | 0.6% |
| P | 1321 | 0.6% |
| N | 982 | 0.4% |
| Other values (16) | 4405 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 228094 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 100202 | |
| S | 98426 | |
| C | 8932 | 3.9% |
| A | 7095 | 3.1% |
| M | 2280 | 1.0% |
| R | 1558 | 0.7% |
| B | 1489 | 0.7% |
| X | 1404 | 0.6% |
| P | 1321 | 0.6% |
| N | 982 | 0.4% |
| Other values (16) | 4405 | 1.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 228094 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 100202 | |
| S | 98426 | |
| C | 8932 | 3.9% |
| A | 7095 | 3.1% |
| M | 2280 | 1.0% |
| R | 1558 | 0.7% |
| B | 1489 | 0.7% |
| X | 1404 | 0.6% |
| P | 1321 | 0.6% |
| N | 982 | 0.4% |
| Other values (16) | 4405 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 228094 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 100202 | |
| S | 98426 | |
| C | 8932 | 3.9% |
| A | 7095 | 3.1% |
| M | 2280 | 1.0% |
| R | 1558 | 0.7% |
| B | 1489 | 0.7% |
| X | 1404 | 0.6% |
| P | 1321 | 0.6% |
| N | 982 | 0.4% |
| Other values (16) | 4405 | 1.9% |
stateProvince
Text
Missing 
| Distinct | 228 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 78016 |
| Missing (%) | 41.8% |
| Memory size | 1.4 MiB |
Length
| Max length | 28 |
|---|---|
| Median length | 11 |
| Mean length | 10.38558514 |
| Min length | 4 |
Unique
| Unique | 48 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Connecticut |
|---|---|
| 2nd row | Connecticut |
| 3rd row | British Columbia |
| 4th row | Connecticut |
| 5th row | Connecticut |
| Value | Count | Frequency (%) |
| connecticut | 62098 | |
| new | 5448 | 4.4% |
| california | 3651 | 2.9% |
| michigan | 3126 | 2.5% |
| florida | 2732 | 2.2% |
| hampshire | 2664 | 2.1% |
| massachusetts | 2337 | 1.9% |
| maine | 2073 | 1.7% |
| columbia | 2034 | 1.6% |
| british | 1905 | 1.5% |
| Other values (255) | 35967 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 153675 | |
| t | 143885 | |
| c | 135676 | |
| i | 108030 | |
| o | 93850 | |
| e | 90883 | |
| u | 72414 | |
| C | 69792 | 6.2% |
| a | 54867 | 4.9% |
| r | 26005 | 2.3% |
| Other values (48) | 177894 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 987704 | |
| Uppercase Letter | 123543 | 11.0% |
| Space Separator | 15522 | 1.4% |
| Dash Punctuation | 202 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 153675 | |
| t | 143885 | |
| c | 135676 | |
| i | 108030 | |
| o | 93850 | |
| e | 90883 | |
| u | 72414 | |
| a | 54867 | 5.6% |
| r | 26005 | 2.6% |
| s | 23900 | 2.4% |
| Other values (21) | 84519 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 69792 | |
| M | 9865 | 8.0% |
| N | 8790 | 7.1% |
| H | 4662 | 3.8% |
| S | 3193 | 2.6% |
| F | 2734 | 2.2% |
| W | 2621 | 2.1% |
| V | 2372 | 1.9% |
| B | 2286 | 1.9% |
| P | 2093 | 1.7% |
| Other values (15) | 15135 | 12.3% |
Space Separator
| Value | Count | Frequency (%) |
| 15522 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 202 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1111247 | |
| Common | 15724 | 1.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 153675 | |
| t | 143885 | |
| c | 135676 | |
| i | 108030 | |
| o | 93850 | |
| e | 90883 | |
| u | 72414 | |
| C | 69792 | |
| a | 54867 | 4.9% |
| r | 26005 | 2.3% |
| Other values (46) | 162170 |
Common
| Value | Count | Frequency (%) |
| 15522 | ||
| - | 202 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1126602 | |
| None | 369 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 153675 | |
| t | 143885 | |
| c | 135676 | |
| i | 108030 | |
| o | 93850 | |
| e | 90883 | |
| u | 72414 | |
| C | 69792 | 6.2% |
| a | 54867 | 4.9% |
| r | 26005 | 2.3% |
| Other values (43) | 177525 |
None
| Value | Count | Frequency (%) |
| á | 107 | |
| í | 98 | |
| ü | 97 | |
| ó | 34 | 9.2% |
| é | 33 | 8.9% |
county
Text
Missing 
| Distinct | 881 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 98586 |
| Missing (%) | 52.9% |
| Memory size | 1.4 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 31 |
| Mean length | 15.47382964 |
| Min length | 4 |
Unique
| Unique | 234 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | New London County |
|---|---|
| 2nd row | New Haven County |
| 3rd row | Litchfield County |
| 4th row | Hartford County |
| 5th row | Litchfield County |
| Value | Count | Frequency (%) |
| county | 86823 | |
| new | 27711 | 13.4% |
| haven | 21492 | 10.4% |
| hartford | 10602 | 5.1% |
| litchfield | 8892 | 4.3% |
| fairfield | 6414 | 3.1% |
| london | 6205 | 3.0% |
| middlesex | 4458 | 2.2% |
| windham | 2098 | 1.0% |
| tolland | 1927 | 0.9% |
| Other values (919) | 29493 | 14.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 142574 | 10.5% |
| o | 128105 | 9.4% |
| 118172 | 8.7% | |
| t | 116308 | 8.5% |
| u | 93310 | 6.9% |
| C | 90545 | 6.7% |
| e | 90254 | 6.6% |
| y | 88473 | 6.5% |
| a | 63460 | 4.7% |
| d | 48213 | 3.5% |
| Other values (49) | 381401 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1035147 | |
| Uppercase Letter | 206666 | 15.2% |
| Space Separator | 118172 | 8.7% |
| Dash Punctuation | 444 | < 0.1% |
| Other Punctuation | 384 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 142574 | |
| o | 128105 | |
| t | 116308 | |
| u | 93310 | |
| e | 90254 | |
| y | 88473 | |
| a | 63460 | 6.1% |
| d | 48213 | 4.7% |
| i | 47877 | 4.6% |
| r | 39508 | 3.8% |
| Other values (17) | 177065 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 90545 | |
| H | 33277 | 16.1% |
| N | 28182 | 13.6% |
| L | 16295 | 7.9% |
| M | 7866 | 3.8% |
| F | 7523 | 3.6% |
| S | 4016 | 1.9% |
| W | 3238 | 1.6% |
| T | 2375 | 1.1% |
| B | 2345 | 1.1% |
| Other values (16) | 11004 | 5.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 223 | |
| ' | 161 |
Space Separator
| Value | Count | Frequency (%) |
| 118172 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 444 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1241813 | |
| Common | 119002 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 142574 | |
| o | 128105 | 10.3% |
| t | 116308 | 9.4% |
| u | 93310 | 7.5% |
| C | 90545 | 7.3% |
| e | 90254 | 7.3% |
| y | 88473 | 7.1% |
| a | 63460 | 5.1% |
| d | 48213 | 3.9% |
| i | 47877 | 3.9% |
| Other values (43) | 332694 |
Common
| Value | Count | Frequency (%) |
| 118172 | ||
| - | 444 | 0.4% |
| . | 223 | 0.2% |
| ' | 161 | 0.1% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1360813 | |
| None | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 142574 | 10.5% |
| o | 128105 | 9.4% |
| 118172 | 8.7% | |
| t | 116308 | 8.5% |
| u | 93310 | 6.9% |
| C | 90545 | 6.7% |
| e | 90254 | 6.6% |
| y | 88473 | 6.5% |
| a | 63460 | 4.7% |
| d | 48213 | 3.5% |
| Other values (48) | 381399 |
None
| Value | Count | Frequency (%) |
| ó | 2 |
municipality
Text
Missing 
| Distinct | 2118 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 110052 |
| Missing (%) | 59.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 39 |
|---|---|
| Median length | 30 |
| Mean length | 8.966486656 |
| Min length | 3 |
Unique
| Unique | 841 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | Salem |
|---|---|
| 2nd row | New Haven |
| 3rd row | Washington |
| 4th row | Southington |
| 5th row | Cornwall |
| Value | Count | Frequency (%) |
| haven | 8458 | 8.7% |
| new | 8105 | 8.3% |
| southington | 2857 | 2.9% |
| north | 2691 | 2.8% |
| east | 2584 | 2.7% |
| guilford | 2301 | 2.4% |
| salisbury | 1956 | 2.0% |
| lyme | 1877 | 1.9% |
| branford | 1824 | 1.9% |
| hartford | 1809 | 1.9% |
| Other values (2013) | 62977 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 53847 | 7.9% |
| e | 53791 | 7.8% |
| n | 53538 | 7.8% |
| r | 52706 | 7.7% |
| a | 51213 | 7.5% |
| t | 42856 | 6.2% |
| i | 37612 | 5.5% |
| l | 34531 | 5.0% |
| d | 28780 | 4.2% |
| 20962 | 3.1% | |
| Other values (52) | 255894 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 566373 | |
| Uppercase Letter | 97469 | 14.2% |
| Space Separator | 20962 | 3.1% |
| Other Punctuation | 688 | 0.1% |
| Dash Punctuation | 236 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 53847 | 9.5% |
| e | 53791 | 9.5% |
| n | 53538 | 9.5% |
| r | 52706 | 9.3% |
| a | 51213 | 9.0% |
| t | 42856 | 7.6% |
| i | 37612 | 6.6% |
| l | 34531 | 6.1% |
| d | 28780 | 5.1% |
| s | 19429 | 3.4% |
| Other values (19) | 138070 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 13453 | |
| H | 13413 | |
| N | 12984 | |
| W | 9137 | |
| B | 6850 | 7.0% |
| G | 5994 | 6.1% |
| C | 4838 | 5.0% |
| M | 4772 | 4.9% |
| L | 4537 | 4.7% |
| E | 3457 | 3.5% |
| Other values (16) | 18034 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 399 | |
| & | 196 | |
| . | 93 | 13.5% |
Space Separator
| Value | Count | Frequency (%) |
| 20962 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 236 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 663842 | |
| Common | 21888 | 3.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 53847 | 8.1% |
| e | 53791 | 8.1% |
| n | 53538 | 8.1% |
| r | 52706 | 7.9% |
| a | 51213 | 7.7% |
| t | 42856 | 6.5% |
| i | 37612 | 5.7% |
| l | 34531 | 5.2% |
| d | 28780 | 4.3% |
| s | 19429 | 2.9% |
| Other values (45) | 235539 |
Common
| Value | Count | Frequency (%) |
| 20962 | ||
| ' | 399 | 1.8% |
| - | 236 | 1.1% |
| & | 196 | 0.9% |
| . | 93 | 0.4% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 685722 | |
| None | 8 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 53847 | 7.9% |
| e | 53791 | 7.8% |
| n | 53538 | 7.8% |
| r | 52706 | 7.7% |
| a | 51213 | 7.5% |
| t | 42856 | 6.2% |
| i | 37612 | 5.5% |
| l | 34531 | 5.0% |
| d | 28780 | 4.2% |
| 20962 | 3.1% | |
| Other values (49) | 255886 |
None
| Value | Count | Frequency (%) |
| á | 3 | |
| é | 3 | |
| ç | 2 |
locality
Text
Missing 
| Distinct | 21413 |
|---|---|
| Distinct (%) | 35.0% |
| Missing | 125307 |
| Missing (%) | 67.2% |
| Memory size | 1.4 MiB |
Length
| Max length | 351 |
|---|---|
| Median length | 190 |
| Mean length | 26.85044592 |
| Min length | 3 |
Unique
| Unique | 14163 ? |
|---|---|
| Unique (%) | 23.1% |
Sample
| 1st row | near Scotch Creek, Shushwap Lake |
|---|---|
| 2nd row | South Shuttle Street |
| 3rd row | Calumet Island, Timbalier Bay |
| 4th row | Rio Blanco |
| 5th row | Oak Hill |
| Value | Count | Frequency (%) |
| of | 13616 | 5.1% |
| near | 6444 | 2.4% |
| island | 5988 | 2.3% |
| river | 4368 | 1.6% |
| lake | 4069 | 1.5% |
| and | 3691 | 1.4% |
| road | 3087 | 1.2% |
| yale | 3008 | 1.1% |
| mountains | 2923 | 1.1% |
| west | 2905 | 1.1% |
| Other values (11704) | 214841 |
Most occurring characters
| Value | Count | Frequency (%) |
| 203718 | 12.4% | |
| a | 141357 | 8.6% |
| e | 131463 | 8.0% |
| o | 116914 | 7.1% |
| n | 109604 | 6.7% |
| r | 89713 | 5.5% |
| t | 83094 | 5.1% |
| i | 75103 | 4.6% |
| l | 66840 | 4.1% |
| s | 66454 | 4.0% |
| Other values (84) | 559578 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1197147 | |
| Space Separator | 203718 | 12.4% |
| Uppercase Letter | 186670 | 11.4% |
| Other Punctuation | 33882 | 2.1% |
| Decimal Number | 13304 | 0.8% |
| Close Punctuation | 3613 | 0.2% |
| Open Punctuation | 3582 | 0.2% |
| Dash Punctuation | 1450 | 0.1% |
| Other Symbol | 423 | < 0.1% |
| Math Symbol | 49 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 141357 | |
| e | 131463 | |
| o | 116914 | |
| n | 109604 | |
| r | 89713 | 7.5% |
| t | 83094 | 6.9% |
| i | 75103 | 6.3% |
| l | 66840 | 5.6% |
| s | 66454 | 5.6% |
| d | 50253 | 4.2% |
| Other values (24) | 266352 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 17688 | 9.5% |
| S | 17239 | 9.2% |
| R | 16853 | 9.0% |
| C | 14862 | 8.0% |
| P | 14723 | 7.9% |
| B | 14281 | 7.7% |
| L | 12462 | 6.7% |
| H | 8806 | 4.7% |
| N | 8375 | 4.5% |
| I | 8098 | 4.3% |
| Other values (17) | 53283 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 22303 | |
| . | 6163 | 18.2% |
| ' | 3859 | 11.4% |
| / | 345 | 1.0% |
| " | 313 | 0.9% |
| ; | 267 | 0.8% |
| : | 241 | 0.7% |
| ? | 164 | 0.5% |
| & | 134 | 0.4% |
| # | 92 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2497 | |
| 2 | 1786 | |
| 0 | 1562 | |
| 3 | 1451 | |
| 5 | 1340 | |
| 4 | 1323 | |
| 7 | 900 | 6.8% |
| 9 | 872 | 6.6% |
| 6 | 818 | 6.1% |
| 8 | 755 | 5.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 3215 | |
| ) | 397 | 11.0% |
| } | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 3212 | |
| ( | 369 | 10.3% |
| { | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 42 | |
| + | 6 | 12.2% |
| > | 1 | 2.0% |
Space Separator
| Value | Count | Frequency (%) |
| 203718 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1450 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 423 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1383817 | |
| Common | 260021 | 15.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 141357 | 10.2% |
| e | 131463 | 9.5% |
| o | 116914 | 8.4% |
| n | 109604 | 7.9% |
| r | 89713 | 6.5% |
| t | 83094 | 6.0% |
| i | 75103 | 5.4% |
| l | 66840 | 4.8% |
| s | 66454 | 4.8% |
| d | 50253 | 3.6% |
| Other values (51) | 453022 |
Common
| Value | Count | Frequency (%) |
| 203718 | ||
| , | 22303 | 8.6% |
| . | 6163 | 2.4% |
| ' | 3859 | 1.5% |
| ] | 3215 | 1.2% |
| [ | 3212 | 1.2% |
| 1 | 2497 | 1.0% |
| 2 | 1786 | 0.7% |
| 0 | 1562 | 0.6% |
| 3 | 1451 | 0.6% |
| Other values (23) | 10255 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1643370 | |
| None | 468 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 203718 | 12.4% | |
| a | 141357 | 8.6% |
| e | 131463 | 8.0% |
| o | 116914 | 7.1% |
| n | 109604 | 6.7% |
| r | 89713 | 5.5% |
| t | 83094 | 5.1% |
| i | 75103 | 4.6% |
| l | 66840 | 4.1% |
| s | 66454 | 4.0% |
| Other values (74) | 559110 |
None
| Value | Count | Frequency (%) |
| ° | 423 | |
| é | 14 | 3.0% |
| á | 9 | 1.9% |
| í | 6 | 1.3% |
| à | 6 | 1.3% |
| Î | 4 | 0.9% |
| ú | 2 | 0.4% |
| ñ | 2 | 0.4% |
| ã | 1 | 0.2% |
| ä | 1 | 0.2% |
Missing 
| Distinct | 884 |
|---|---|
| Distinct (%) | 11.6% |
| Missing | 178933 |
| Missing (%) | 95.9% |
| Memory size | 1.4 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 13 |
| Mean length | 5.866508689 |
| Min length | 3 |
Unique
| Unique | 369 ? |
|---|---|
| Unique (%) | 4.9% |
Sample
| 1st row | 564 m |
|---|---|
| 2nd row | 1450-1550 m |
| 3rd row | 1012 m |
| 4th row | 137 m |
| 5th row | 1463 m |
| Value | Count | Frequency (%) |
| m | 7482 | |
| 1524 | 267 | 1.8% |
| 305 | 236 | 1.6% |
| 1219 | 190 | 1.3% |
| 1829 | 179 | 1.2% |
| 366 | 170 | 1.1% |
| 914 | 167 | 1.1% |
| 610 | 162 | 1.1% |
| 2743 | 153 | 1.0% |
| 244 | 150 | 1.0% |
| Other values (875) | 6036 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7596 | ||
| m | 7482 | |
| 0 | 5082 | |
| 1 | 4893 | |
| 2 | 3978 | |
| 3 | 2614 | 5.9% |
| 5 | 2551 | 5.7% |
| 4 | 2434 | 5.5% |
| 6 | 1983 | 4.4% |
| 8 | 1718 | 3.9% |
| Other values (5) | 4231 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 28520 | |
| Lowercase Letter | 7710 | 17.3% |
| Space Separator | 7596 | 17.0% |
| Dash Punctuation | 736 | 1.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5082 | |
| 1 | 4893 | |
| 2 | 3978 | |
| 3 | 2614 | |
| 5 | 2551 | |
| 4 | 2434 | |
| 6 | 1983 | 7.0% |
| 8 | 1718 | 6.0% |
| 7 | 1718 | 6.0% |
| 9 | 1549 | 5.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 7482 | |
| f | 114 | 1.5% |
| t | 114 | 1.5% |
Space Separator
| Value | Count | Frequency (%) |
| 7596 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 736 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 36852 | |
| Latin | 7710 | 17.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7596 | ||
| 0 | 5082 | |
| 1 | 4893 | |
| 2 | 3978 | |
| 3 | 2614 | 7.1% |
| 5 | 2551 | 6.9% |
| 4 | 2434 | 6.6% |
| 6 | 1983 | 5.4% |
| 8 | 1718 | 4.7% |
| 7 | 1718 | 4.7% |
| Other values (2) | 2285 | 6.2% |
Latin
| Value | Count | Frequency (%) |
| m | 7482 | |
| f | 114 | 1.5% |
| t | 114 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 44562 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7596 | ||
| m | 7482 | |
| 0 | 5082 | |
| 1 | 4893 | |
| 2 | 3978 | |
| 3 | 2614 | 5.9% |
| 5 | 2551 | 5.7% |
| 4 | 2434 | 5.5% |
| 6 | 1983 | 4.4% |
| 8 | 1718 | 3.9% |
| Other values (5) | 4231 |
decimalLatitude
Text
Missing 
| Distinct | 8319 |
|---|---|
| Distinct (%) | 8.0% |
| Missing | 82100 |
| Missing (%) | 44.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 7.598693849 |
| Min length | 3 |
Unique
| Unique | 4029 ? |
|---|---|
| Unique (%) | 3.9% |
Sample
| 1st row | 41.4854 |
|---|---|
| 2nd row | 41.407 |
| 3rd row | 51.0 |
| 4th row | 41.6523 |
| 5th row | 41.605 |
| Value | Count | Frequency (%) |
| 41.407 | 2004 | 1.9% |
| 41.305111 | 1951 | 1.9% |
| 41.3114 | 1870 | 1.8% |
| 41.605 | 1661 | 1.6% |
| 41.5583 | 1312 | 1.3% |
| 41.6049 | 1164 | 1.1% |
| 41.986 | 1069 | 1.0% |
| 46.166667 | 1017 | 1.0% |
| 41.6153 | 994 | 1.0% |
| 41.7413 | 947 | 0.9% |
| Other values (8306) | 90440 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 139283 | |
| 1 | 119799 | |
| . | 104429 | |
| 3 | 70791 | |
| 6 | 63059 | |
| 9 | 54886 | 6.9% |
| 7 | 53117 | 6.7% |
| 5 | 52064 | 6.6% |
| 2 | 50360 | 6.3% |
| 8 | 44495 | 5.6% |
| Other values (2) | 41241 | 5.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 688290 | |
| Other Punctuation | 104429 | 13.2% |
| Dash Punctuation | 805 | 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 139283 | |
| 1 | 119799 | |
| 3 | 70791 | |
| 6 | 63059 | |
| 9 | 54886 | 8.0% |
| 7 | 53117 | 7.7% |
| 5 | 52064 | 7.6% |
| 2 | 50360 | 7.3% |
| 8 | 44495 | 6.5% |
| 0 | 40436 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 104429 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 805 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 793524 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 139283 | |
| 1 | 119799 | |
| . | 104429 | |
| 3 | 70791 | |
| 6 | 63059 | |
| 9 | 54886 | 6.9% |
| 7 | 53117 | 6.7% |
| 5 | 52064 | 6.6% |
| 2 | 50360 | 6.3% |
| 8 | 44495 | 5.6% |
| Other values (2) | 41241 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 793524 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 139283 | |
| 1 | 119799 | |
| . | 104429 | |
| 3 | 70791 | |
| 6 | 63059 | |
| 9 | 54886 | 6.9% |
| 7 | 53117 | 6.7% |
| 5 | 52064 | 6.6% |
| 2 | 50360 | 6.3% |
| 8 | 44495 | 5.6% |
| Other values (2) | 41241 | 5.2% |
decimalLongitude
Text
Missing 
| Distinct | 8315 |
|---|---|
| Distinct (%) | 8.0% |
| Missing | 82100 |
| Missing (%) | 44.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 8.608298461 |
| Min length | 3 |
Unique
| Unique | 4015 ? |
|---|---|
| Unique (%) | 3.8% |
Sample
| 1st row | -72.2664 |
|---|---|
| 2nd row | -72.9316 |
| 3rd row | -119.0 |
| 4th row | -73.3145 |
| 5th row | -72.88 |
| Value | Count | Frequency (%) |
| 72.88 | 2825 | 2.7% |
| 72.9316 | 1988 | 1.9% |
| 72.920823 | 1951 | 1.9% |
| 72.9247 | 1870 | 1.8% |
| 73.1931 | 1368 | 1.3% |
| 73.036 | 1211 | 1.2% |
| 72.8575 | 1086 | 1.0% |
| 73.4257 | 1069 | 1.0% |
| 60.75 | 1048 | 1.0% |
| 72.4831 | 902 | 0.9% |
| Other values (8302) | 89111 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 128553 | |
| . | 104429 | |
| - | 102523 | |
| 2 | 98658 | |
| 3 | 84490 | |
| 1 | 72475 | |
| 8 | 63406 | |
| 6 | 53825 | |
| 9 | 52391 | |
| 5 | 47500 | 5.3% |
| Other values (2) | 90706 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 692004 | |
| Other Punctuation | 104429 | 11.6% |
| Dash Punctuation | 102523 | 11.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 128553 | |
| 2 | 98658 | |
| 3 | 84490 | |
| 1 | 72475 | |
| 8 | 63406 | |
| 6 | 53825 | |
| 9 | 52391 | |
| 5 | 47500 | 6.9% |
| 4 | 46836 | 6.8% |
| 0 | 43870 | 6.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 104429 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 102523 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 898956 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 128553 | |
| . | 104429 | |
| - | 102523 | |
| 2 | 98658 | |
| 3 | 84490 | |
| 1 | 72475 | |
| 8 | 63406 | |
| 6 | 53825 | |
| 9 | 52391 | |
| 5 | 47500 | 5.3% |
| Other values (2) | 90706 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 898956 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 128553 | |
| . | 104429 | |
| - | 102523 | |
| 2 | 98658 | |
| 3 | 84490 | |
| 1 | 72475 | |
| 8 | 63406 | |
| 6 | 53825 | |
| 9 | 52391 | |
| 5 | 47500 | 5.3% |
| Other values (2) | 90706 |
coordinateUncertaintyInMeters
Text
Missing 
| Distinct | 5428 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 82138 |
| Missing (%) | 44.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 6.201770268 |
| Min length | 3 |
Unique
| Unique | 2086 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | 7093.0 |
|---|---|
| 2nd row | 7710.0 |
| 3rd row | 1189.0 |
| 4th row | 7762.0 |
| 5th row | 7725.0 |
| Value | Count | Frequency (%) |
| 7725.0 | 2825 | 2.7% |
| 1851.0 | 2328 | 2.2% |
| 7710.0 | 1992 | 1.9% |
| 6384.0 | 1951 | 1.9% |
| 7484.0 | 1870 | 1.8% |
| 9878.0 | 1817 | 1.7% |
| 5062.0 | 1804 | 1.7% |
| 11151.0 | 1368 | 1.3% |
| 6630.0 | 1312 | 1.3% |
| 7184.0 | 1083 | 1.0% |
| Other values (5418) | 86041 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 152670 | |
| . | 104391 | |
| 1 | 58864 | 9.1% |
| 7 | 50281 | 7.8% |
| 5 | 47230 | 7.3% |
| 8 | 43792 | 6.8% |
| 6 | 41665 | 6.4% |
| 4 | 40758 | 6.3% |
| 3 | 37978 | 5.9% |
| 2 | 35411 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 543018 | |
| Other Punctuation | 104391 | 16.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 152670 | |
| 1 | 58864 | 10.8% |
| 7 | 50281 | 9.3% |
| 5 | 47230 | 8.7% |
| 8 | 43792 | 8.1% |
| 6 | 41665 | 7.7% |
| 4 | 40758 | 7.5% |
| 3 | 37978 | 7.0% |
| 2 | 35411 | 6.5% |
| 9 | 34369 | 6.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 104391 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 647409 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 152670 | |
| . | 104391 | |
| 1 | 58864 | 9.1% |
| 7 | 50281 | 7.8% |
| 5 | 47230 | 7.3% |
| 8 | 43792 | 6.8% |
| 6 | 41665 | 6.4% |
| 4 | 40758 | 6.3% |
| 3 | 37978 | 5.9% |
| 2 | 35411 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 647409 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 152670 | |
| . | 104391 | |
| 1 | 58864 | 9.1% |
| 7 | 50281 | 7.8% |
| 5 | 47230 | 7.3% |
| 8 | 43792 | 6.8% |
| 6 | 41665 | 6.4% |
| 4 | 40758 | 6.3% |
| 3 | 37978 | 5.9% |
| 2 | 35411 | 5.5% |
georeferencedBy
Text
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 182211 |
| Missing (%) | 97.7% |
| Memory size | 1.4 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 16.97522001 |
| Min length | 13 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Angus J. Mossman |
|---|---|
| 2nd row | Angus J. Mossman |
| 3rd row | Angus J. Mossman |
| 4th row | Angus J. Mossman |
| 5th row | Angus J. Mossman |
| Value | Count | Frequency (%) |
| angus | 2204 | |
| j | 2204 | |
| mossman | 2204 | |
| patrick | 2110 | |
| w | 2110 | |
| sweeney | 2110 | |
| lynn | 1 | < 0.1% |
| a | 1 | < 0.1% |
| jones | 1 | < 0.1% |
| jesse | 1 | < 0.1% |
| Other values (6) | 6 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 8634 | 11.8% | |
| s | 6616 | 9.0% |
| n | 6522 | 8.9% |
| e | 6336 | 8.6% |
| a | 4318 | 5.9% |
| . | 4316 | 5.9% |
| J | 2206 | 3.0% |
| A | 2205 | 3.0% |
| o | 2205 | 3.0% |
| g | 2204 | 3.0% |
| Other values (21) | 27737 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 47397 | |
| Uppercase Letter | 12952 | 17.7% |
| Space Separator | 8634 | 11.8% |
| Other Punctuation | 4316 | 5.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 6616 | |
| n | 6522 | |
| e | 6336 | |
| a | 4318 | |
| o | 2205 | 4.7% |
| g | 2204 | 4.7% |
| u | 2204 | 4.7% |
| m | 2204 | 4.7% |
| r | 2114 | 4.5% |
| w | 2112 | 4.5% |
| Other values (8) | 10562 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 2206 | |
| A | 2205 | |
| M | 2204 | |
| W | 2110 | |
| S | 2110 | |
| P | 2110 | |
| L | 2 | < 0.1% |
| E | 2 | < 0.1% |
| N | 1 | < 0.1% |
| F | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 8634 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4316 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 60349 | |
| Common | 12950 | 17.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 6616 | 11.0% |
| n | 6522 | 10.8% |
| e | 6336 | 10.5% |
| a | 4318 | 7.2% |
| J | 2206 | 3.7% |
| A | 2205 | 3.7% |
| o | 2205 | 3.7% |
| g | 2204 | 3.7% |
| u | 2204 | 3.7% |
| M | 2204 | 3.7% |
| Other values (19) | 23329 |
Common
| Value | Count | Frequency (%) |
| 8634 | ||
| . | 4316 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 73299 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8634 | 11.8% | |
| s | 6616 | 9.0% |
| n | 6522 | 8.9% |
| e | 6336 | 8.6% |
| a | 4318 | 5.9% |
| . | 4316 | 5.9% |
| J | 2206 | 3.0% |
| A | 2205 | 3.0% |
| o | 2205 | 3.0% |
| g | 2204 | 3.0% |
| Other values (21) | 27737 |
Missing 
| Distinct | 43 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 174887 |
| Missing (%) | 93.8% |
| Memory size | 1.4 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 8.869266449 |
| Min length | 4 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 2016-11-04 |
|---|---|
| 2nd row | 2023-06-13 |
| 3rd row | 2015 |
| 4th row | 2016-11-04 |
| 5th row | 2023-08-24 |
| Value | Count | Frequency (%) |
| 2015 | 2193 | |
| 2016-11-04 | 1996 | |
| 2023-08-24 | 1867 | |
| 2016-06-23 | 1595 | |
| 2023-06-13 | 1462 | |
| 2024-05-18 | 1141 | |
| 2024-01-17 | 616 | 5.3% |
| 2023-08-13 | 395 | 3.4% |
| 2016-10-31 | 121 | 1.0% |
| 2016-10-28 | 51 | 0.4% |
| Other values (33) | 205 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 21119 | |
| 2 | 20850 | |
| - | 18896 | |
| 1 | 14751 | |
| 3 | 7402 | 7.2% |
| 6 | 6972 | 6.8% |
| 4 | 5713 | 5.5% |
| 8 | 3515 | 3.4% |
| 5 | 3345 | 3.2% |
| 7 | 673 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 84360 | |
| Dash Punctuation | 18896 | 18.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 21119 | |
| 2 | 20850 | |
| 1 | 14751 | |
| 3 | 7402 | 8.8% |
| 6 | 6972 | 8.3% |
| 4 | 5713 | 6.8% |
| 8 | 3515 | 4.2% |
| 5 | 3345 | 4.0% |
| 7 | 673 | 0.8% |
| 9 | 20 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 18896 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 103256 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 21119 | |
| 2 | 20850 | |
| - | 18896 | |
| 1 | 14751 | |
| 3 | 7402 | 7.2% |
| 6 | 6972 | 6.8% |
| 4 | 5713 | 5.5% |
| 8 | 3515 | 3.4% |
| 5 | 3345 | 3.2% |
| 7 | 673 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 103256 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 21119 | |
| 2 | 20850 | |
| - | 18896 | |
| 1 | 14751 | |
| 3 | 7402 | 7.2% |
| 6 | 6972 | 6.8% |
| 4 | 5713 | 5.5% |
| 8 | 3515 | 3.4% |
| 5 | 3345 | 3.2% |
| 7 | 673 | 0.7% |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 82331 |
| Missing (%) | 44.1% |
| Memory size | 1.4 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 17 |
| Mean length | 16.43626557 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | physical resource |
|---|---|
| 2nd row | digital resource |
| 3rd row | digital resource |
| 4th row | physical resource |
| 5th row | physical resource |
| Value | Count | Frequency (%) |
| resource | 102675 | |
| physical | 53073 | |
| digital | 49602 | |
| unspecified | 1523 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 208396 | |
| r | 205350 | |
| s | 157271 | |
| c | 157271 | |
| i | 155323 | |
| u | 104198 | 6.1% |
| a | 102675 | 6.0% |
| l | 102675 | 6.0% |
| 102675 | 6.0% | |
| o | 102675 | 6.0% |
| Other values (8) | 314117 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1609951 | |
| Space Separator | 102675 | 6.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 208396 | |
| r | 205350 | |
| s | 157271 | |
| c | 157271 | |
| i | 155323 | |
| u | 104198 | |
| a | 102675 | 6.4% |
| l | 102675 | 6.4% |
| o | 102675 | 6.4% |
| p | 54596 | 3.4% |
| Other values (7) | 259521 |
Space Separator
| Value | Count | Frequency (%) |
| 102675 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1609951 | |
| Common | 102675 | 6.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 208396 | |
| r | 205350 | |
| s | 157271 | |
| c | 157271 | |
| i | 155323 | |
| u | 104198 | |
| a | 102675 | 6.4% |
| l | 102675 | 6.4% |
| o | 102675 | 6.4% |
| p | 54596 | 3.4% |
| Other values (7) | 259521 |
Common
| Value | Count | Frequency (%) |
| 102675 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1712626 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 208396 | |
| r | 205350 | |
| s | 157271 | |
| c | 157271 | |
| i | 155323 | |
| u | 104198 | 6.1% |
| a | 102675 | 6.0% |
| l | 102675 | 6.0% |
| 102675 | 6.0% | |
| o | 102675 | 6.0% |
| Other values (8) | 314117 |
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 83888 |
| Missing (%) | 45.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 15 |
| Mean length | 14.92256506 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | topographic map |
|---|---|
| 2nd row | GEOLocate |
| 3rd row | GEOLocate |
| 4th row | topographic map |
| 5th row | topographic map |
| Value | Count | Frequency (%) |
| topographic | 53027 | |
| map | 53027 | |
| geolocate | 31271 | |
| usa | 13341 | 6.3% |
| state | 13210 | 6.3% |
| digital | 13210 | 6.3% |
| data | 13210 | 6.3% |
| resource | 13210 | 6.3% |
| vertnet | 1811 | 0.9% |
| unspecified | 1524 | 0.7% |
| Other values (12) | 3467 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 190363 | |
| p | 160611 | |
| o | 150929 | 9.9% |
| t | 142161 | 9.3% |
| 107667 | 7.0% | |
| c | 99042 | 6.5% |
| i | 83723 | 5.5% |
| e | 77911 | 5.1% |
| r | 68240 | 4.5% |
| g | 66434 | 4.3% |
| Other values (30) | 384586 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1196078 | |
| Uppercase Letter | 227398 | 14.8% |
| Space Separator | 107667 | 7.0% |
| Decimal Number | 524 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 190363 | |
| p | 160611 | |
| o | 150929 | |
| t | 142161 | |
| c | 99042 | |
| i | 83723 | |
| e | 77911 | |
| r | 68240 | 5.7% |
| g | 66434 | 5.6% |
| h | 53210 | 4.4% |
| Other values (8) | 103454 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 32808 | |
| E | 31836 | |
| O | 31271 | |
| L | 31271 | |
| S | 27769 | |
| D | 26420 | |
| R | 13341 | |
| U | 13341 | |
| A | 13341 | |
| V | 2062 | 0.9% |
| Other values (7) | 3938 | 1.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 131 | |
| 4 | 131 | |
| 0 | 131 | |
| 2 | 131 |
Space Separator
| Value | Count | Frequency (%) |
| 107667 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1423476 | |
| Common | 108191 | 7.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 190363 | |
| p | 160611 | |
| o | 150929 | |
| t | 142161 | |
| c | 99042 | 7.0% |
| i | 83723 | 5.9% |
| e | 77911 | 5.5% |
| r | 68240 | 4.8% |
| g | 66434 | 4.7% |
| h | 53210 | 3.7% |
| Other values (25) | 330852 |
Common
| Value | Count | Frequency (%) |
| 107667 | ||
| 1 | 131 | 0.1% |
| 4 | 131 | 0.1% |
| 0 | 131 | 0.1% |
| 2 | 131 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1531667 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 190363 | |
| p | 160611 | |
| o | 150929 | 9.9% |
| t | 142161 | 9.3% |
| 107667 | 7.0% | |
| c | 99042 | 6.5% |
| i | 83723 | 5.5% |
| e | 77911 | 5.1% |
| r | 68240 | 4.5% |
| g | 66434 | 4.3% |
| Other values (30) | 384586 |
Missing 
| Distinct | 6514 |
|---|---|
| Distinct (%) | 6.4% |
| Missing | 85474 |
| Missing (%) | 45.8% |
| Memory size | 1.4 MiB |
Length
| Max length | 465 |
|---|---|
| Median length | 15 |
| Mean length | 63.16058582 |
| Min length | 2 |
Unique
| Unique | 3825 ? |
|---|---|
| Unique (%) | 3.8% |
Sample
| 1st row | from CT DEP Map |
|---|---|
| 2nd row | ex Argus |
| 3rd row | jlsanesdoc (2015-07-14 13:16:00); Geolocated to Shuswap Lake |
| 4th row | from CT DEP Map |
| 5th row | from CT DEP Map |
| Value | Count | Frequency (%) |
| the | 81840 | 8.6% |
| from | 60217 | 6.3% |
| ct | 56024 | 5.9% |
| map | 53164 | 5.6% |
| dep | 53055 | 5.6% |
| of | 33115 | 3.5% |
| centroid | 28035 | 3.0% |
| polygon | 28033 | 3.0% |
| uncertainty | 18145 | 1.9% |
| database | 17580 | 1.9% |
| Other values (8113) | 520344 |
Most occurring characters
| Value | Count | Frequency (%) |
| 848525 | 13.3% | |
| e | 492230 | 7.7% |
| o | 402931 | 6.3% |
| t | 360014 | 5.6% |
| a | 343662 | 5.4% |
| r | 306928 | 4.8% |
| n | 285528 | 4.5% |
| i | 227311 | 3.6% |
| s | 197953 | 3.1% |
| d | 177049 | 2.8% |
| Other values (75) | 2740562 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4013075 | |
| Space Separator | 848525 | 13.3% |
| Uppercase Letter | 636507 | 10.0% |
| Decimal Number | 495160 | 7.8% |
| Other Punctuation | 236193 | 3.7% |
| Dash Punctuation | 59535 | 0.9% |
| Open Punctuation | 42944 | 0.7% |
| Close Punctuation | 42942 | 0.7% |
| Connector Punctuation | 7578 | 0.1% |
| Math Symbol | 233 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 492230 | |
| o | 402931 | 10.0% |
| t | 360014 | 9.0% |
| a | 343662 | 8.6% |
| r | 306928 | 7.6% |
| n | 285528 | 7.1% |
| i | 227311 | 5.7% |
| s | 197953 | 4.9% |
| d | 177049 | 4.4% |
| l | 138359 | 3.4% |
| Other values (17) | 1081110 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 87444 | |
| T | 81216 | |
| D | 77234 | |
| M | 76938 | |
| P | 67525 | |
| E | 62274 | |
| A | 34574 | 5.4% |
| G | 34244 | 5.4% |
| N | 23701 | 3.7% |
| S | 14621 | 2.3% |
| Other values (16) | 76736 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 81586 | |
| : | 52775 | |
| , | 50428 | |
| / | 28633 | 12.1% |
| ; | 21546 | 9.1% |
| & | 772 | 0.3% |
| ' | 213 | 0.1% |
| " | 120 | 0.1% |
| ? | 88 | < 0.1% |
| % | 27 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 116785 | |
| 2 | 103300 | |
| 1 | 87747 | |
| 5 | 44518 | 9.0% |
| 4 | 35937 | 7.3% |
| 6 | 29500 | 6.0% |
| 3 | 24561 | 5.0% |
| 8 | 24163 | 4.9% |
| 7 | 14864 | 3.0% |
| 9 | 13785 | 2.8% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 177 | |
| + | 55 | 23.6% |
| ~ | 1 | 0.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 42943 | |
| [ | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 42941 | |
| ] | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 848525 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 59535 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7578 |
Other Symbol
| Value | Count | Frequency (%) |
| ¦ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4649582 | |
| Common | 1733111 | 27.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 492230 | 10.6% |
| o | 402931 | 8.7% |
| t | 360014 | 7.7% |
| a | 343662 | 7.4% |
| r | 306928 | 6.6% |
| n | 285528 | 6.1% |
| i | 227311 | 4.9% |
| s | 197953 | 4.3% |
| d | 177049 | 3.8% |
| l | 138359 | 3.0% |
| Other values (43) | 1717617 |
Common
| Value | Count | Frequency (%) |
| 848525 | ||
| 0 | 116785 | 6.7% |
| 2 | 103300 | 6.0% |
| 1 | 87747 | 5.1% |
| . | 81586 | 4.7% |
| - | 59535 | 3.4% |
| : | 52775 | 3.0% |
| , | 50428 | 2.9% |
| 5 | 44518 | 2.6% |
| ( | 42943 | 2.5% |
| Other values (22) | 244969 | 14.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6382686 | |
| None | 7 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 848525 | 13.3% | |
| e | 492230 | 7.7% |
| o | 402931 | 6.3% |
| t | 360014 | 5.6% |
| a | 343662 | 5.4% |
| r | 306928 | 4.8% |
| n | 285528 | 4.5% |
| i | 227311 | 3.6% |
| s | 197953 | 3.1% |
| d | 177049 | 2.8% |
| Other values (73) | 2740555 |
None
| Value | Count | Frequency (%) |
| ÿ | 6 | |
| ¦ | 1 | 14.3% |
typeStatus
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 182608 |
| Missing (%) | 97.9% |
| Memory size | 1.4 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 7 |
| Mean length | 7.223922469 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ISOTYPE |
|---|---|
| 2nd row | ISOSYNTYPE |
| 3rd row | ISOTYPE |
| 4th row | ISOTYPE |
| 5th row | ISOTYPE |
| Value | Count | Frequency (%) |
| isotype | 2414 | |
| syntype | 851 | 21.7% |
| isolectotype | 201 | 5.1% |
| type | 197 | 5.0% |
| isosyntype | 103 | 2.6% |
| holotype | 90 | 2.3% |
| paratype | 19 | 0.5% |
| lectotype | 17 | 0.4% |
| cotype | 16 | 0.4% |
| isoneotype | 6 | 0.2% |
| Other values (2) | 7 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 4875 | |
| T | 4145 | |
| E | 4145 | |
| P | 3947 | |
| S | 3679 | |
| O | 3157 | |
| I | 2725 | |
| N | 960 | 3.4% |
| L | 308 | 1.1% |
| C | 234 | 0.8% |
| Other values (3) | 150 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 28325 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 4875 | |
| T | 4145 | |
| E | 4145 | |
| P | 3947 | |
| S | 3679 | |
| O | 3157 | |
| I | 2725 | |
| N | 960 | 3.4% |
| L | 308 | 1.1% |
| C | 234 | 0.8% |
| Other values (3) | 150 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28325 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 4875 | |
| T | 4145 | |
| E | 4145 | |
| P | 3947 | |
| S | 3679 | |
| O | 3157 | |
| I | 2725 | |
| N | 960 | 3.4% |
| L | 308 | 1.1% |
| C | 234 | 0.8% |
| Other values (3) | 150 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28325 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 4875 | |
| T | 4145 | |
| E | 4145 | |
| P | 3947 | |
| S | 3679 | |
| O | 3157 | |
| I | 2725 | |
| N | 960 | 3.4% |
| L | 308 | 1.1% |
| C | 234 | 0.8% |
| Other values (3) | 150 | 0.5% |
identifiedBy
Text
Missing 
| Distinct | 193 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 180415 |
| Missing (%) | 96.7% |
| Memory size | 1.4 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 20 |
| Mean length | 16.31534184 |
| Min length | 5 |
Unique
| Unique | 54 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | Martin C. Van Boskirk |
|---|---|
| 2nd row | Alexander W. Evans |
| 3rd row | Mason E. Hale |
| 4th row | Alexander W. Evans |
| 5th row | M. H. Lewis |
| Value | Count | Frequency (%) |
| w | 1055 | 5.9% |
| alexander | 744 | 4.2% |
| evans | 744 | 4.2% |
| george | 644 | 3.6% |
| f | 634 | 3.6% |
| j | 597 | 3.4% |
| c | 484 | 2.7% |
| k | 480 | 2.7% |
| h | 421 | 2.4% |
| carl | 419 | 2.4% |
| Other values (324) | 11577 |
Most occurring characters
| Value | Count | Frequency (%) |
| 11685 | 11.7% | |
| e | 8430 | 8.5% |
| r | 7926 | 7.9% |
| a | 6479 | 6.5% |
| n | 5716 | 5.7% |
| l | 5553 | 5.6% |
| . | 5495 | 5.5% |
| o | 5002 | 5.0% |
| i | 4237 | 4.2% |
| s | 3571 | 3.6% |
| Other values (44) | 35658 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 64705 | |
| Uppercase Letter | 17846 | 17.9% |
| Space Separator | 11685 | 11.7% |
| Other Punctuation | 5495 | 5.5% |
| Dash Punctuation | 21 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 8430 | |
| r | 7926 | |
| a | 6479 | |
| n | 5716 | |
| l | 5553 | |
| o | 5002 | |
| i | 4237 | 6.5% |
| s | 3571 | 5.5% |
| t | 3188 | 4.9% |
| d | 2155 | 3.3% |
| Other values (18) | 12448 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 1964 | |
| A | 1863 | |
| M | 1766 | |
| C | 1598 | 9.0% |
| E | 1480 | 8.3% |
| G | 1140 | 6.4% |
| B | 1130 | 6.3% |
| J | 997 | 5.6% |
| H | 913 | 5.1% |
| F | 862 | 4.8% |
| Other values (13) | 4133 |
Space Separator
| Value | Count | Frequency (%) |
| 11685 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5495 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 21 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 82551 | |
| Common | 17201 | 17.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 8430 | 10.2% |
| r | 7926 | 9.6% |
| a | 6479 | 7.8% |
| n | 5716 | 6.9% |
| l | 5553 | 6.7% |
| o | 5002 | 6.1% |
| i | 4237 | 5.1% |
| s | 3571 | 4.3% |
| t | 3188 | 3.9% |
| d | 2155 | 2.6% |
| Other values (41) | 30294 |
Common
| Value | Count | Frequency (%) |
| 11685 | ||
| . | 5495 | |
| - | 21 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 99635 | |
| None | 117 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 11685 | 11.7% | |
| e | 8430 | 8.5% |
| r | 7926 | 8.0% |
| a | 6479 | 6.5% |
| n | 5716 | 5.7% |
| l | 5553 | 5.6% |
| . | 5495 | 5.5% |
| o | 5002 | 5.0% |
| i | 4237 | 4.3% |
| s | 3571 | 3.6% |
| Other values (42) | 35541 |
None
| Value | Count | Frequency (%) |
| á | 105 | |
| é | 12 | 10.3% |
dateIdentified
Text
Missing 
| Distinct | 85 |
|---|---|
| Distinct (%) | 4.4% |
| Missing | 184582 |
| Missing (%) | 99.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | 1997-01-01T00:00:00 |
|---|---|
| 2nd row | 1946-01-01T00:00:00 |
| 3rd row | 1946-01-01T00:00:00 |
| 4th row | 1946-01-01T00:00:00 |
| 5th row | 1995-01-01T00:00:00 |
| Value | Count | Frequency (%) |
| 1995-01-01t00:00:00 | 414 | |
| 1997-01-01t00:00:00 | 349 | |
| 1984-01-01t00:00:00 | 135 | 6.9% |
| 1954-01-01t00:00:00 | 102 | 5.2% |
| 1956-01-01t00:00:00 | 80 | 4.1% |
| 1946-01-01t00:00:00 | 63 | 3.2% |
| 1962-01-01t00:00:00 | 61 | 3.1% |
| 1953-01-01t00:00:00 | 60 | 3.1% |
| 1957-01-01t00:00:00 | 54 | 2.8% |
| 1979-01-01t00:00:00 | 52 | 2.7% |
| Other values (75) | 577 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 15888 | |
| 1 | 5780 | 15.6% |
| - | 3894 | 10.5% |
| : | 3894 | 10.5% |
| 9 | 2681 | 7.2% |
| T | 1947 | 5.3% |
| 5 | 908 | 2.5% |
| 7 | 588 | 1.6% |
| 8 | 337 | 0.9% |
| 2 | 334 | 0.9% |
| Other values (3) | 742 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 27258 | |
| Dash Punctuation | 3894 | 10.5% |
| Other Punctuation | 3894 | 10.5% |
| Uppercase Letter | 1947 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 15888 | |
| 1 | 5780 | 21.2% |
| 9 | 2681 | 9.8% |
| 5 | 908 | 3.3% |
| 7 | 588 | 2.2% |
| 8 | 337 | 1.2% |
| 2 | 334 | 1.2% |
| 4 | 333 | 1.2% |
| 6 | 285 | 1.0% |
| 3 | 124 | 0.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3894 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 3894 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1947 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 35046 | |
| Latin | 1947 | 5.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 15888 | |
| 1 | 5780 | 16.5% |
| - | 3894 | 11.1% |
| : | 3894 | 11.1% |
| 9 | 2681 | 7.6% |
| 5 | 908 | 2.6% |
| 7 | 588 | 1.7% |
| 8 | 337 | 1.0% |
| 2 | 334 | 1.0% |
| 4 | 333 | 1.0% |
| Other values (2) | 409 | 1.2% |
Latin
| Value | Count | Frequency (%) |
| T | 1947 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36993 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 15888 | |
| 1 | 5780 | 15.6% |
| - | 3894 | 10.5% |
| : | 3894 | 10.5% |
| 9 | 2681 | 7.2% |
| T | 1947 | 5.3% |
| 5 | 908 | 2.5% |
| 7 | 588 | 1.6% |
| 8 | 337 | 0.9% |
| 2 | 334 | 0.9% |
| Other values (3) | 742 | 2.0% |
Missing 
| Distinct | 2949 |
|---|---|
| Distinct (%) | 79.8% |
| Missing | 182833 |
| Missing (%) | 98.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 283 |
|---|---|
| Median length | 167 |
| Mean length | 48.17316017 |
| Min length | 9 |
Unique
| Unique | 2406 ? |
|---|---|
| Unique (%) | 65.1% |
Sample
| 1st row | Proc. Amer. Acad. Arts. 22: 420. 1887. |
|---|---|
| 2nd row | Mem. Amer. Acad. Arts. n.s. 520. 1862. |
| 3rd row | Pl. Wright. (Grisebach) 1: 173. 1860. |
| 4th row | Proc. Amer. Acad. 22: 428. 1887. |
| 5th row | Proceedings of the American Academy of Arts and Sciences. 7: 381. 1868. |
| Value | Count | Frequency (%) |
| of | 1913 | 6.4% |
| the | 1010 | 3.4% |
| arts | 820 | 2.8% |
| acad | 670 | 2.3% |
| amer | 663 | 2.2% |
| american | 639 | 2.1% |
| and | 627 | 2.1% |
| academy | 614 | 2.1% |
| sciences | 570 | 1.9% |
| proc | 563 | 1.9% |
| Other values (2176) | 21659 |
Most occurring characters
| Value | Count | Frequency (%) |
| 26052 | 14.6% | |
| . | 14090 | 7.9% |
| e | 10440 | 5.9% |
| a | 8607 | 4.8% |
| 1 | 7541 | 4.2% |
| o | 7231 | 4.1% |
| r | 7176 | 4.0% |
| n | 6647 | 3.7% |
| t | 6588 | 3.7% |
| i | 6274 | 3.5% |
| Other values (78) | 77402 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 87279 | |
| Decimal Number | 30944 | 17.4% |
| Space Separator | 26052 | 14.6% |
| Other Punctuation | 17631 | 9.9% |
| Uppercase Letter | 14635 | 8.2% |
| Dash Punctuation | 555 | 0.3% |
| Close Punctuation | 470 | 0.3% |
| Open Punctuation | 470 | 0.3% |
| Math Symbol | 12 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 10440 | |
| a | 8607 | |
| o | 7231 | 8.3% |
| r | 7176 | 8.2% |
| n | 6647 | 7.6% |
| t | 6588 | 7.5% |
| i | 6274 | 7.2% |
| c | 5785 | 6.6% |
| s | 4551 | 5.2% |
| l | 4141 | 4.7% |
| Other values (23) | 19839 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 3827 | |
| P | 2135 | |
| S | 1494 | 10.2% |
| C | 1489 | 10.2% |
| B | 1022 | 7.0% |
| G | 628 | 4.3% |
| N | 580 | 4.0% |
| F | 472 | 3.2% |
| M | 426 | 2.9% |
| R | 325 | 2.2% |
| Other values (16) | 2237 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 14090 | |
| : | 2970 | 16.8% |
| , | 359 | 2.0% |
| ; | 90 | 0.5% |
| ' | 75 | 0.4% |
| & | 29 | 0.2% |
| " | 13 | 0.1% |
| # | 2 | < 0.1% |
| / | 2 | < 0.1% |
| ? | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7541 | |
| 8 | 5251 | |
| 6 | 3239 | |
| 2 | 2901 | 9.4% |
| 7 | 2254 | 7.3% |
| 9 | 2082 | 6.7% |
| 3 | 2071 | 6.7% |
| 4 | 1961 | 6.3% |
| 5 | 1948 | 6.3% |
| 0 | 1696 | 5.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 530 | |
| – | 25 | 4.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 311 | |
| ] | 159 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 310 | |
| [ | 160 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 10 | |
| + | 2 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 26052 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 101914 | |
| Common | 76134 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 10440 | 10.2% |
| a | 8607 | 8.4% |
| o | 7231 | 7.1% |
| r | 7176 | 7.0% |
| n | 6647 | 6.5% |
| t | 6588 | 6.5% |
| i | 6274 | 6.2% |
| c | 5785 | 5.7% |
| s | 4551 | 4.5% |
| l | 4141 | 4.1% |
| Other values (49) | 34474 |
Common
| Value | Count | Frequency (%) |
| 26052 | ||
| . | 14090 | |
| 1 | 7541 | 9.9% |
| 8 | 5251 | 6.9% |
| 6 | 3239 | 4.3% |
| : | 2970 | 3.9% |
| 2 | 2901 | 3.8% |
| 7 | 2254 | 3.0% |
| 9 | 2082 | 2.7% |
| 3 | 2071 | 2.7% |
| Other values (19) | 7683 | 10.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 177975 | |
| None | 48 | < 0.1% |
| Punctuation | 25 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 26052 | 14.6% | |
| . | 14090 | 7.9% |
| e | 10440 | 5.9% |
| a | 8607 | 4.8% |
| 1 | 7541 | 4.2% |
| o | 7231 | 4.1% |
| r | 7176 | 4.0% |
| n | 6647 | 3.7% |
| t | 6588 | 3.7% |
| i | 6274 | 3.5% |
| Other values (70) | 77329 |
None
| Value | Count | Frequency (%) |
| ü | 25 | |
| é | 9 | 18.8% |
| ö | 8 | 16.7% |
| è | 2 | 4.2% |
| ä | 2 | 4.2% |
| ë | 1 | 2.1% |
| ñ | 1 | 2.1% |
Punctuation
| Value | Count | Frequency (%) |
| – | 25 |
| Distinct | 13242 |
|---|---|
| Distinct (%) | 7.1% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Memory size | 1.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.097645715 |
| Min length | 1 |
Unique
| Unique | 5385 ? |
|---|---|
| Unique (%) | 2.9% |
Sample
| 1st row | 2700991 |
|---|---|
| 2nd row | 3170096 |
| 3rd row | 2728060 |
| 4th row | 4276910 |
| 5th row | 6 |
| Value | Count | Frequency (%) |
| 6 | 28377 | 15.2% |
| 2721893 | 1395 | 0.7% |
| 2651126 | 1343 | 0.7% |
| 2650111 | 1163 | 0.6% |
| 3196548 | 1155 | 0.6% |
| 2650583 | 1063 | 0.6% |
| 2933951 | 736 | 0.4% |
| 2651736 | 535 | 0.3% |
| 2650888 | 527 | 0.3% |
| 2689220 | 495 | 0.3% |
| Other values (13232) | 149722 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 182570 | |
| 6 | 143478 | |
| 7 | 116217 | |
| 8 | 112612 | |
| 5 | 110350 | |
| 3 | 109965 | |
| 1 | 107204 | |
| 0 | 94336 | |
| 9 | 88487 | |
| 4 | 72059 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1137278 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 182570 | |
| 6 | 143478 | |
| 7 | 116217 | |
| 8 | 112612 | |
| 5 | 110350 | |
| 3 | 109965 | |
| 1 | 107204 | |
| 0 | 94336 | |
| 9 | 88487 | |
| 4 | 72059 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1137278 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 182570 | |
| 6 | 143478 | |
| 7 | 116217 | |
| 8 | 112612 | |
| 5 | 110350 | |
| 3 | 109965 | |
| 1 | 107204 | |
| 0 | 94336 | |
| 9 | 88487 | |
| 4 | 72059 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1137278 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 182570 | |
| 6 | 143478 | |
| 7 | 116217 | |
| 8 | 112612 | |
| 5 | 110350 | |
| 3 | 109965 | |
| 1 | 107204 | |
| 0 | 94336 | |
| 9 | 88487 | |
| 4 | 72059 | 6.3% |
scientificName
Text
| Distinct | 15722 |
|---|---|
| Distinct (%) | 8.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 87 |
|---|---|
| Median length | 68 |
| Mean length | 24.48391939 |
| Min length | 5 |
Unique
| Unique | 6979 ? |
|---|---|
| Unique (%) | 3.7% |
Sample
| 1st row | Luzula bulbosa (Alph.Wood) Smyth & L.C.R.Smyth |
|---|---|
| 2nd row | Gentiana clausa Raf. |
| 3rd row | Carex muhlenbergii Kunth ex Boott |
| 4th row | Lophocolea minor Nees |
| 5th row | Plantae |
| Value | Count | Frequency (%) |
| l | 52365 | 9.0% |
| plantae | 28377 | 4.9% |
| ex | 10744 | 1.8% |
| carex | 8803 | 1.5% |
| 7927 | 1.4% | |
| willd | 5079 | 0.9% |
| hedw | 4951 | 0.8% |
| michx | 4881 | 0.8% |
| dumort | 4636 | 0.8% |
| var | 3379 | 0.6% |
| Other values (13172) | 451450 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 436599 | 9.6% |
| 396063 | 8.7% | |
| i | 310153 | 6.8% |
| e | 285903 | 6.3% |
| l | 247925 | 5.4% |
| r | 240173 | 5.3% |
| n | 217491 | 4.8% |
| . | 202035 | 4.4% |
| o | 199856 | 4.4% |
| u | 192016 | 4.2% |
| Other values (86) | 1838747 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3367734 | |
| Uppercase Letter | 460197 | 10.1% |
| Space Separator | 396063 | 8.7% |
| Other Punctuation | 215275 | 4.7% |
| Close Punctuation | 54697 | 1.2% |
| Open Punctuation | 54697 | 1.2% |
| Decimal Number | 16824 | 0.4% |
| Dash Punctuation | 894 | < 0.1% |
| Math Symbol | 576 | < 0.1% |
| Connector Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 436599 | |
| i | 310153 | 9.2% |
| e | 285903 | 8.5% |
| l | 247925 | 7.4% |
| r | 240173 | 7.1% |
| n | 217491 | 6.5% |
| o | 199856 | 5.9% |
| u | 192016 | 5.7% |
| t | 187781 | 5.6% |
| s | 187105 | 5.6% |
| Other values (35) | 862732 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 77852 | |
| P | 56667 | |
| S | 42264 | 9.2% |
| C | 37080 | 8.1% |
| A | 33196 | 7.2% |
| M | 24632 | 5.4% |
| B | 22344 | 4.9% |
| H | 22082 | 4.8% |
| D | 19711 | 4.3% |
| R | 17098 | 3.7% |
| Other values (21) | 107271 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4792 | |
| 8 | 4211 | |
| 2 | 1967 | |
| 0 | 1687 | 10.0% |
| 3 | 998 | 5.9% |
| 4 | 948 | 5.6% |
| 9 | 681 | 4.0% |
| 7 | 548 | 3.3% |
| 5 | 545 | 3.2% |
| 6 | 447 | 2.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 202035 | |
| & | 7927 | 3.7% |
| , | 5244 | 2.4% |
| ' | 69 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 396063 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 54697 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 54697 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 894 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 576 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3827931 | |
| Common | 739030 | 16.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 436599 | 11.4% |
| i | 310153 | 8.1% |
| e | 285903 | 7.5% |
| l | 247925 | 6.5% |
| r | 240173 | 6.3% |
| n | 217491 | 5.7% |
| o | 199856 | 5.2% |
| u | 192016 | 5.0% |
| t | 187781 | 4.9% |
| s | 187105 | 4.9% |
| Other values (66) | 1322929 |
Common
| Value | Count | Frequency (%) |
| 396063 | ||
| . | 202035 | |
| ) | 54697 | 7.4% |
| ( | 54697 | 7.4% |
| & | 7927 | 1.1% |
| , | 5244 | 0.7% |
| 1 | 4792 | 0.6% |
| 8 | 4211 | 0.6% |
| 2 | 1967 | 0.3% |
| 0 | 1687 | 0.2% |
| Other values (10) | 5710 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4560557 | |
| None | 6404 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 436599 | 9.6% |
| 396063 | 8.7% | |
| i | 310153 | 6.8% |
| e | 285903 | 6.3% |
| l | 247925 | 5.4% |
| r | 240173 | 5.3% |
| n | 217491 | 4.8% |
| . | 202035 | 4.4% |
| o | 199856 | 4.4% |
| u | 192016 | 4.2% |
| Other values (61) | 1832343 |
None
| Value | Count | Frequency (%) |
| ü | 2543 | |
| ö | 899 | 14.0% |
| ä | 609 | 9.5% |
| é | 590 | 9.2% |
| × | 576 | 9.0% |
| ø | 425 | 6.6% |
| Á | 262 | 4.1% |
| Å | 245 | 3.8% |
| è | 134 | 2.1% |
| á | 44 | 0.7% |
| Other values (15) | 77 | 1.2% |
| Distinct | 792 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Memory size | 1.4 MiB |
Length
| Max length | 119 |
|---|---|
| Median length | 91 |
| Mean length | 47.98150243 |
| Min length | 5 |
Unique
| Unique | 93 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Plantae; Tracheophyta; Poales; Juncaceae |
|---|---|
| 2nd row | Plantae; Tracheophyta; Asteridae; Gentianales; Gentianaceae |
| 3rd row | Plantae; Tracheophyta; Poales; Cyperaceae |
| 4th row | Plantae; Bryophyta; Hepaticopsida; Jungermanniales; Lophocoleaceae |
| 5th row | Plantae |
| Value | Count | Frequency (%) |
| plantae | 177514 | |
| tracheophyta | 104057 | 13.3% |
| bryophyta | 37100 | 4.7% |
| poales | 23133 | 3.0% |
| hepaticopsida | 21780 | 2.8% |
| asteridae | 20956 | 2.7% |
| rosidae | 18590 | 2.4% |
| jungermanniales | 16103 | 2.1% |
| polypodiales | 14202 | 1.8% |
| cyperaceae | 13776 | 1.8% |
| Other values (1036) | 334890 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1419556 | |
| e | 1079293 | 12.1% |
| ; | 595590 | 6.7% |
| 595590 | 6.7% | |
| t | 468738 | 5.2% |
| l | 453031 | 5.1% |
| o | 447517 | 5.0% |
| c | 388641 | 4.3% |
| r | 357029 | 4.0% |
| h | 342137 | 3.8% |
| Other values (43) | 2801956 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6975797 | |
| Uppercase Letter | 782101 | 8.7% |
| Other Punctuation | 595590 | 6.7% |
| Space Separator | 595590 | 6.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1419556 | |
| e | 1079293 | |
| t | 468738 | 6.7% |
| l | 453031 | 6.5% |
| o | 447517 | 6.4% |
| c | 388641 | 5.6% |
| r | 357029 | 5.1% |
| h | 342137 | 4.9% |
| i | 338965 | 4.9% |
| n | 337540 | 4.8% |
| Other values (16) | 1343350 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 253748 | |
| T | 107375 | |
| A | 69797 | 8.9% |
| B | 65121 | 8.3% |
| C | 46187 | 5.9% |
| R | 42144 | 5.4% |
| F | 32277 | 4.1% |
| H | 30368 | 3.9% |
| L | 25762 | 3.3% |
| J | 22652 | 2.9% |
| Other values (15) | 86670 | 11.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 595590 |
Space Separator
| Value | Count | Frequency (%) |
| 595590 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7757898 | |
| Common | 1191180 | 13.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1419556 | |
| e | 1079293 | |
| t | 468738 | 6.0% |
| l | 453031 | 5.8% |
| o | 447517 | 5.8% |
| c | 388641 | 5.0% |
| r | 357029 | 4.6% |
| h | 342137 | 4.4% |
| i | 338965 | 4.4% |
| n | 337540 | 4.4% |
| Other values (41) | 2125451 |
Common
| Value | Count | Frequency (%) |
| ; | 595590 | |
| 595590 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8949078 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1419556 | |
| e | 1079293 | 12.1% |
| ; | 595590 | 6.7% |
| 595590 | 6.7% | |
| t | 468738 | 5.2% |
| l | 453031 | 5.1% |
| o | 447517 | 5.0% |
| c | 388641 | 4.3% |
| r | 357029 | 4.0% |
| h | 342137 | 3.8% |
| Other values (43) | 2801956 |
kingdom
Text
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 7 |
| Mean length | 6.981981354 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Plantae |
|---|---|
| 2nd row | Plantae |
| 3rd row | Plantae |
| 4th row | Plantae |
| 5th row | Plantae |
| Value | Count | Frequency (%) |
| plantae | 177496 | |
| fungi | 5161 | 2.8% |
| chromista | 2981 | 1.6% |
| bacteria | 869 | 0.5% |
| incertae | 18 | < 0.1% |
| sedis | 18 | < 0.1% |
| animalia | 2 | < 0.1% |
| protozoa | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 359735 | |
| n | 182677 | |
| t | 181366 | |
| e | 178419 | |
| P | 177498 | |
| l | 177498 | |
| i | 9051 | 0.7% |
| F | 5161 | 0.4% |
| u | 5161 | 0.4% |
| g | 5161 | 0.4% |
| Other values (12) | 20615 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1115813 | |
| Uppercase Letter | 186511 | 14.3% |
| Space Separator | 18 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 359735 | |
| n | 182677 | |
| t | 181366 | |
| e | 178419 | |
| l | 177498 | |
| i | 9051 | 0.8% |
| u | 5161 | 0.5% |
| g | 5161 | 0.5% |
| r | 3870 | 0.3% |
| s | 3017 | 0.3% |
| Other values (6) | 9858 | 0.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 177498 | |
| F | 5161 | 2.8% |
| C | 2981 | 1.6% |
| B | 869 | 0.5% |
| A | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 18 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1302324 | |
| Common | 18 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 359735 | |
| n | 182677 | |
| t | 181366 | |
| e | 178419 | |
| P | 177498 | |
| l | 177498 | |
| i | 9051 | 0.7% |
| F | 5161 | 0.4% |
| u | 5161 | 0.4% |
| g | 5161 | 0.4% |
| Other values (11) | 20597 | 1.6% |
Common
| Value | Count | Frequency (%) |
| 18 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1302342 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 359735 | |
| n | 182677 | |
| t | 181366 | |
| e | 178419 | |
| P | 177498 | |
| l | 177498 | |
| i | 9051 | 0.7% |
| F | 5161 | 0.4% |
| u | 5161 | 0.4% |
| g | 5161 | 0.4% |
| Other values (12) | 20615 | 1.6% |
phylum
Text
Missing 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 28431 |
| Missing (%) | 15.2% |
| Memory size | 1.4 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 12 |
| Mean length | 11.95495832 |
| Min length | 8 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Tracheophyta |
|---|---|
| 2nd row | Tracheophyta |
| 3rd row | Tracheophyta |
| 4th row | Marchantiophyta |
| 5th row | Tracheophyta |
| Value | Count | Frequency (%) |
| tracheophyta | 104064 | |
| marchantiophyta | 21776 | 13.8% |
| bryophyta | 14896 | 9.4% |
| rhodophyta | 5566 | 3.5% |
| ascomycota | 5121 | 3.2% |
| ochrophyta | 2980 | 1.9% |
| chlorophyta | 1763 | 1.1% |
| cyanobacteria | 867 | 0.5% |
| charophyta | 620 | 0.4% |
| anthocerotophyta | 428 | 0.3% |
| Other values (7) | 17 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 308078 | |
| h | 289292 | |
| t | 180729 | |
| y | 172989 | |
| o | 171415 | |
| p | 152096 | |
| r | 147397 | |
| c | 140372 | |
| e | 105363 | 5.6% |
| T | 104064 | 5.5% |
| Other values (19) | 118260 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1731957 | |
| Uppercase Letter | 158098 | 8.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 308078 | |
| h | 289292 | |
| t | 180729 | |
| y | 172989 | |
| o | 171415 | |
| p | 152096 | |
| r | 147397 | |
| c | 140372 | |
| e | 105363 | 6.1% |
| n | 23074 | 1.3% |
| Other values (9) | 41152 | 2.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 104064 | |
| M | 21778 | 13.8% |
| B | 14905 | 9.4% |
| R | 5566 | 3.5% |
| A | 5550 | 3.5% |
| C | 3251 | 2.1% |
| O | 2980 | 1.9% |
| E | 2 | < 0.1% |
| G | 1 | < 0.1% |
| F | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1890055 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 308078 | |
| h | 289292 | |
| t | 180729 | |
| y | 172989 | |
| o | 171415 | |
| p | 152096 | |
| r | 147397 | |
| c | 140372 | |
| e | 105363 | 5.6% |
| T | 104064 | 5.5% |
| Other values (19) | 118260 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1890055 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 308078 | |
| h | 289292 | |
| t | 180729 | |
| y | 172989 | |
| o | 171415 | |
| p | 152096 | |
| r | 147397 | |
| c | 140372 | |
| e | 105363 | 5.6% |
| T | 104064 | 5.5% |
| Other values (19) | 118260 | 6.3% |
class
Text
Missing 
| Distinct | 49 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 28457 |
| Missing (%) | 15.3% |
| Memory size | 1.4 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 18 |
| Mean length | 12.76794752 |
| Min length | 7 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Liliopsida |
|---|---|
| 2nd row | Magnoliopsida |
| 3rd row | Liliopsida |
| 4th row | Jungermanniopsida |
| 5th row | Magnoliopsida |
| Value | Count | Frequency (%) |
| magnoliopsida | 47673 | |
| liliopsida | 34102 | |
| jungermanniopsida | 19329 | |
| polypodiopsida | 18403 | 11.6% |
| bryopsida | 11812 | 7.5% |
| florideophyceae | 5407 | 3.4% |
| lecanoromycetes | 4619 | 2.9% |
| phaeophyceae | 2903 | 1.8% |
| marchantiopsida | 2447 | 1.5% |
| sphagnopsida | 2360 | 1.5% |
| Other values (39) | 9017 | 5.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 309597 | |
| o | 259227 | |
| a | 237559 | |
| p | 174973 | |
| d | 167339 | |
| s | 146300 | |
| n | 118880 | 5.9% |
| l | 108074 | 5.4% |
| g | 69804 | 3.5% |
| e | 66212 | 3.3% |
| Other values (29) | 360290 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1860183 | |
| Uppercase Letter | 158072 | 7.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 309597 | |
| o | 259227 | |
| a | 237559 | |
| p | 174973 | |
| d | 167339 | |
| s | 146300 | |
| n | 118880 | 6.4% |
| l | 108074 | 5.8% |
| g | 69804 | 3.8% |
| e | 66212 | 3.6% |
| Other values (12) | 202218 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 50120 | |
| L | 40823 | |
| P | 23671 | |
| J | 19329 | 12.2% |
| B | 11983 | 7.6% |
| F | 5407 | 3.4% |
| S | 2396 | 1.5% |
| C | 1691 | 1.1% |
| U | 1428 | 0.9% |
| A | 622 | 0.4% |
| Other values (7) | 602 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2018255 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 309597 | |
| o | 259227 | |
| a | 237559 | |
| p | 174973 | |
| d | 167339 | |
| s | 146300 | |
| n | 118880 | 5.9% |
| l | 108074 | 5.4% |
| g | 69804 | 3.5% |
| e | 66212 | 3.3% |
| Other values (29) | 360290 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2018255 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 309597 | |
| o | 259227 | |
| a | 237559 | |
| p | 174973 | |
| d | 167339 | |
| s | 146300 | |
| n | 118880 | 5.9% |
| l | 108074 | 5.4% |
| g | 69804 | 3.5% |
| e | 66212 | 3.3% |
| Other values (29) | 360290 |
order
Text
Missing 
| Distinct | 249 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 28496 |
| Missing (%) | 15.3% |
| Memory size | 1.4 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 15 |
| Mean length | 9.99117273 |
| Min length | 6 |
Unique
| Unique | 23 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Poales |
|---|---|
| 2nd row | Gentianales |
| 3rd row | Poales |
| 4th row | Jungermanniales |
| 5th row | Lamiales |
| Value | Count | Frequency (%) |
| poales | 23133 | 14.6% |
| polypodiales | 14202 | 9.0% |
| jungermanniales | 11845 | 7.5% |
| asterales | 7806 | 4.9% |
| asparagales | 5708 | 3.6% |
| hypnales | 5685 | 3.6% |
| fabales | 5474 | 3.5% |
| porellales | 5373 | 3.4% |
| lamiales | 4708 | 3.0% |
| rosales | 3883 | 2.5% |
| Other values (239) | 70216 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 241522 | |
| l | 214346 | |
| e | 204168 | |
| s | 189382 | |
| i | 88627 | 5.6% |
| o | 84012 | 5.3% |
| n | 68943 | 4.4% |
| r | 64166 | 4.1% |
| P | 48956 | 3.1% |
| p | 44954 | 2.8% |
| Other values (39) | 329859 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1420900 | |
| Uppercase Letter | 158034 | 10.0% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 241522 | |
| l | 214346 | |
| e | 204168 | |
| s | 189382 | |
| i | 88627 | 6.2% |
| o | 84012 | 5.9% |
| n | 68943 | 4.9% |
| r | 64166 | 4.5% |
| p | 44954 | 3.2% |
| y | 35806 | 2.5% |
| Other values (15) | 184974 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 48956 | |
| A | 18515 | 11.7% |
| J | 11845 | 7.5% |
| L | 10817 | 6.8% |
| C | 10375 | 6.6% |
| F | 9147 | 5.8% |
| M | 8185 | 5.2% |
| H | 6767 | 4.3% |
| S | 6495 | 4.1% |
| R | 6216 | 3.9% |
| Other values (13) | 20716 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1578934 | |
| Common | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 241522 | |
| l | 214346 | |
| e | 204168 | |
| s | 189382 | |
| i | 88627 | 5.6% |
| o | 84012 | 5.3% |
| n | 68943 | 4.4% |
| r | 64166 | 4.1% |
| P | 48956 | 3.1% |
| p | 44954 | 2.8% |
| Other values (38) | 329858 |
Common
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1578935 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 241522 | |
| l | 214346 | |
| e | 204168 | |
| s | 189382 | |
| i | 88627 | 5.6% |
| o | 84012 | 5.3% |
| n | 68943 | 4.4% |
| r | 64166 | 4.1% |
| P | 48956 | 3.1% |
| p | 44954 | 2.8% |
| Other values (39) | 329859 |
family
Text
Missing 
| Distinct | 815 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 28710 |
| Missing (%) | 15.4% |
| Memory size | 1.4 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 18 |
| Mean length | 11.48949113 |
| Min length | 7 |
Unique
| Unique | 100 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Juncaceae |
|---|---|
| 2nd row | Gentianaceae |
| 3rd row | Cyperaceae |
| 4th row | Lophocoleaceae |
| 5th row | Phrymaceae |
| Value | Count | Frequency (%) |
| cyperaceae | 13776 | 8.7% |
| asteraceae | 7289 | 4.6% |
| poaceae | 6277 | 4.0% |
| fabaceae | 4763 | 3.0% |
| orchidaceae | 3466 | 2.2% |
| dryopteridaceae | 3452 | 2.2% |
| rosaceae | 3290 | 2.1% |
| pteridaceae | 3008 | 1.9% |
| sphagnaceae | 2360 | 1.5% |
| juncaceae | 2219 | 1.4% |
| Other values (805) | 107919 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 392339 | |
| a | 387850 | |
| c | 194926 | |
| i | 92782 | 5.1% |
| r | 83018 | 4.6% |
| o | 72274 | 4.0% |
| l | 62315 | 3.4% |
| n | 50368 | 2.8% |
| p | 48418 | 2.7% |
| t | 46382 | 2.6% |
| Other values (42) | 382588 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1655439 | |
| Uppercase Letter | 157820 | 8.7% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 392339 | |
| a | 387850 | |
| c | 194926 | |
| i | 92782 | 5.6% |
| r | 83018 | 5.0% |
| o | 72274 | 4.4% |
| l | 62315 | 3.8% |
| n | 50368 | 3.0% |
| p | 48418 | 2.9% |
| t | 46382 | 2.8% |
| Other values (16) | 224767 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 25797 | |
| P | 24928 | |
| A | 20318 | |
| L | 11364 | |
| S | 11029 | |
| R | 9304 | 5.9% |
| F | 8346 | 5.3% |
| O | 7185 | 4.6% |
| D | 6508 | 4.1% |
| B | 6021 | 3.8% |
| Other values (15) | 27020 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1813259 | |
| Common | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 392339 | |
| a | 387850 | |
| c | 194926 | |
| i | 92782 | 5.1% |
| r | 83018 | 4.6% |
| o | 72274 | 4.0% |
| l | 62315 | 3.4% |
| n | 50368 | 2.8% |
| p | 48418 | 2.7% |
| t | 46382 | 2.6% |
| Other values (41) | 382587 |
Common
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1813260 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 392339 | |
| a | 387850 | |
| c | 194926 | |
| i | 92782 | 5.1% |
| r | 83018 | 4.6% |
| o | 72274 | 4.0% |
| l | 62315 | 3.4% |
| n | 50368 | 2.8% |
| p | 48418 | 2.7% |
| t | 46382 | 2.6% |
| Other values (42) | 382588 |
genus
Text
Missing 
| Distinct | 4085 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 28788 |
| Missing (%) | 15.4% |
| Memory size | 1.4 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 16 |
| Mean length | 8.853595451 |
| Min length | 3 |
Unique
| Unique | 1041 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | Luzula |
|---|---|
| 2nd row | Gentiana |
| 3rd row | Carex |
| 4th row | Lophocolea |
| 5th row | Mimulus |
| Value | Count | Frequency (%) |
| carex | 8831 | 5.6% |
| sphagnum | 2360 | 1.5% |
| dryopteris | 2269 | 1.4% |
| juncus | 1784 | 1.1% |
| frullania | 1708 | 1.1% |
| asplenium | 1705 | 1.1% |
| scapania | 1518 | 1.0% |
| sargassum | 1505 | 1.0% |
| polypodium | 1375 | 0.9% |
| viola | 1213 | 0.8% |
| Other values (4075) | 133473 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 159567 | 11.4% |
| i | 125119 | 9.0% |
| e | 94983 | 6.8% |
| o | 94015 | 6.7% |
| r | 89664 | 6.4% |
| l | 84777 | 6.1% |
| u | 81100 | 5.8% |
| s | 71024 | 5.1% |
| n | 64715 | 4.6% |
| m | 60888 | 4.4% |
| Other values (43) | 470723 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1238724 | |
| Uppercase Letter | 157741 | 11.3% |
| Dash Punctuation | 110 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 159567 | |
| i | 125119 | 10.1% |
| e | 94983 | 7.7% |
| o | 94015 | 7.6% |
| r | 89664 | 7.2% |
| l | 84777 | 6.8% |
| u | 81100 | 6.5% |
| s | 71024 | 5.7% |
| n | 64715 | 5.2% |
| m | 60888 | 4.9% |
| Other values (16) | 312872 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 25041 | |
| P | 20312 | |
| S | 19066 | |
| A | 14007 | 8.9% |
| L | 9057 | 5.7% |
| D | 8815 | 5.6% |
| R | 6625 | 4.2% |
| M | 6331 | 4.0% |
| E | 6247 | 4.0% |
| B | 5844 | 3.7% |
| Other values (16) | 36396 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 110 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1396465 | |
| Common | 110 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 159567 | 11.4% |
| i | 125119 | 9.0% |
| e | 94983 | 6.8% |
| o | 94015 | 6.7% |
| r | 89664 | 6.4% |
| l | 84777 | 6.1% |
| u | 81100 | 5.8% |
| s | 71024 | 5.1% |
| n | 64715 | 4.6% |
| m | 60888 | 4.4% |
| Other values (42) | 470613 |
Common
| Value | Count | Frequency (%) |
| - | 110 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1396575 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 159567 | 11.4% |
| i | 125119 | 9.0% |
| e | 94983 | 6.8% |
| o | 94015 | 6.7% |
| r | 89664 | 6.4% |
| l | 84777 | 6.1% |
| u | 81100 | 5.8% |
| s | 71024 | 5.1% |
| n | 64715 | 4.6% |
| m | 60888 | 4.4% |
| Other values (43) | 470723 |
genericName
Text
Missing 
| Distinct | 3709 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 28825 |
| Missing (%) | 15.5% |
| Memory size | 1.4 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 15 |
| Mean length | 8.589090955 |
| Min length | 3 |
Unique
| Unique | 987 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | Luzula |
|---|---|
| 2nd row | Gentiana |
| 3rd row | Carex |
| 4th row | Lophocolea |
| 5th row | Mimulus |
| Value | Count | Frequency (%) |
| carex | 8803 | 5.6% |
| sphagnum | 2360 | 1.5% |
| dryopteris | 2266 | 1.4% |
| juncus | 1814 | 1.2% |
| frullania | 1708 | 1.1% |
| asplenium | 1557 | 1.0% |
| scapania | 1517 | 1.0% |
| sargassum | 1504 | 1.0% |
| polypodium | 1453 | 0.9% |
| scirpus | 1280 | 0.8% |
| Other values (3699) | 133442 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 157291 | 11.6% |
| i | 121653 | 9.0% |
| e | 90240 | 6.7% |
| o | 90134 | 6.7% |
| r | 87682 | 6.5% |
| u | 80913 | 6.0% |
| l | 79580 | 5.9% |
| s | 65454 | 4.8% |
| n | 63491 | 4.7% |
| m | 62423 | 4.6% |
| Other values (43) | 455673 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1196830 | |
| Uppercase Letter | 157704 | 11.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 157291 | |
| i | 121653 | |
| e | 90240 | 7.5% |
| o | 90134 | 7.5% |
| r | 87682 | 7.3% |
| u | 80913 | 6.8% |
| l | 79580 | 6.6% |
| s | 65454 | 5.5% |
| n | 63491 | 5.3% |
| m | 62423 | 5.2% |
| Other values (17) | 297969 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 26048 | |
| P | 20798 | |
| S | 16971 | |
| A | 13940 | 8.8% |
| L | 10843 | 6.9% |
| D | 7729 | 4.9% |
| R | 6963 | 4.4% |
| E | 6742 | 4.3% |
| B | 6350 | 4.0% |
| M | 5932 | 3.8% |
| Other values (16) | 35388 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1354534 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 157291 | 11.6% |
| i | 121653 | 9.0% |
| e | 90240 | 6.7% |
| o | 90134 | 6.7% |
| r | 87682 | 6.5% |
| u | 80913 | 6.0% |
| l | 79580 | 5.9% |
| s | 65454 | 4.8% |
| n | 63491 | 4.7% |
| m | 62423 | 4.6% |
| Other values (43) | 455673 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1354533 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 157291 | 11.6% |
| i | 121653 | 9.0% |
| e | 90240 | 6.7% |
| o | 90134 | 6.7% |
| r | 87682 | 6.5% |
| u | 80913 | 6.0% |
| l | 79580 | 5.9% |
| s | 65454 | 4.8% |
| n | 63491 | 4.7% |
| m | 62423 | 4.6% |
| Other values (42) | 455672 |
None
| Value | Count | Frequency (%) |
| ë | 1 |
specificEpithet
Text
Missing 
| Distinct | 6756 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 54371 |
| Missing (%) | 29.1% |
| Memory size | 1.4 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 17 |
| Mean length | 9.061328107 |
| Min length | 3 |
Unique
| Unique | 2228 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | bulbosa |
|---|---|
| 2nd row | clausa |
| 3rd row | muhlenbergii |
| 4th row | minor |
| 5th row | ringens |
| Value | Count | Frequency (%) |
| canadensis | 1478 | 1.1% |
| virginiana | 723 | 0.5% |
| palustris | 710 | 0.5% |
| canadense | 699 | 0.5% |
| americana | 680 | 0.5% |
| virginica | 544 | 0.4% |
| pubescens | 506 | 0.4% |
| virginianum | 501 | 0.4% |
| heterophylla | 501 | 0.4% |
| nemorosa | 495 | 0.4% |
| Other values (6746) | 125321 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 163211 | |
| i | 133654 | |
| l | 84576 | 7.1% |
| s | 84520 | 7.1% |
| e | 82393 | 6.9% |
| r | 79195 | 6.6% |
| u | 77754 | 6.5% |
| n | 74791 | 6.2% |
| t | 65069 | 5.4% |
| o | 63126 | 5.3% |
| Other values (17) | 289238 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1196844 | |
| Dash Punctuation | 683 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 163211 | |
| i | 133654 | |
| l | 84576 | 7.1% |
| s | 84520 | 7.1% |
| e | 82393 | 6.9% |
| r | 79195 | 6.6% |
| u | 77754 | 6.5% |
| n | 74791 | 6.2% |
| t | 65069 | 5.4% |
| o | 63126 | 5.3% |
| Other values (16) | 288555 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 683 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1196844 | |
| Common | 683 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 163211 | |
| i | 133654 | |
| l | 84576 | 7.1% |
| s | 84520 | 7.1% |
| e | 82393 | 6.9% |
| r | 79195 | 6.6% |
| u | 77754 | 6.5% |
| n | 74791 | 6.2% |
| t | 65069 | 5.4% |
| o | 63126 | 5.3% |
| Other values (16) | 288555 |
Common
| Value | Count | Frequency (%) |
| - | 683 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1197527 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 163211 | |
| i | 133654 | |
| l | 84576 | 7.1% |
| s | 84520 | 7.1% |
| e | 82393 | 6.9% |
| r | 79195 | 6.6% |
| u | 77754 | 6.5% |
| n | 74791 | 6.2% |
| t | 65069 | 5.4% |
| o | 63126 | 5.3% |
| Other values (17) | 289238 |
Missing 
| Distinct | 1039 |
|---|---|
| Distinct (%) | 23.8% |
| Missing | 182164 |
| Missing (%) | 97.7% |
| Memory size | 1.4 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 13 |
| Mean length | 9.057961054 |
| Min length | 4 |
Unique
| Unique | 513 ? |
|---|---|
| Unique (%) | 11.8% |
Sample
| 1st row | tenuifolia |
|---|---|
| 2nd row | pauciflorus |
| 3rd row | abbreviatus |
| 4th row | bellidiastrum |
| 5th row | angustatus |
| Value | Count | Frequency (%) |
| rufescens | 97 | 2.2% |
| americana | 73 | 1.7% |
| intermedia | 62 | 1.4% |
| lanceolatum | 58 | 1.3% |
| gigantea | 49 | 1.1% |
| ciliare | 45 | 1.0% |
| elatum | 40 | 0.9% |
| gracilis | 39 | 0.9% |
| variolosa | 39 | 0.9% |
| pubescens | 37 | 0.8% |
| Other values (1029) | 3826 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5377 | |
| i | 4420 | |
| l | 2907 | 7.4% |
| s | 2842 | 7.2% |
| e | 2822 | 7.1% |
| r | 2570 | 6.5% |
| u | 2464 | 6.2% |
| n | 2336 | 5.9% |
| o | 2194 | 5.5% |
| c | 2104 | 5.3% |
| Other values (17) | 9502 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 39535 | |
| Dash Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5377 | |
| i | 4420 | |
| l | 2907 | 7.4% |
| s | 2842 | 7.2% |
| e | 2822 | 7.1% |
| r | 2570 | 6.5% |
| u | 2464 | 6.2% |
| n | 2336 | 5.9% |
| o | 2194 | 5.5% |
| c | 2104 | 5.3% |
| Other values (16) | 9499 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 39535 | |
| Common | 3 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 5377 | |
| i | 4420 | |
| l | 2907 | 7.4% |
| s | 2842 | 7.2% |
| e | 2822 | 7.1% |
| r | 2570 | 6.5% |
| u | 2464 | 6.2% |
| n | 2336 | 5.9% |
| o | 2194 | 5.5% |
| c | 2104 | 5.3% |
| Other values (16) | 9499 |
Common
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39538 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 5377 | |
| i | 4420 | |
| l | 2907 | 7.4% |
| s | 2842 | 7.2% |
| e | 2822 | 7.1% |
| r | 2570 | 6.5% |
| u | 2464 | 6.2% |
| n | 2336 | 5.9% |
| o | 2194 | 5.5% |
| c | 2104 | 5.3% |
| Other values (17) | 9502 |
taxonRank
Text
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 6.728985841 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SPECIES |
|---|---|
| 2nd row | SPECIES |
| 3rd row | SPECIES |
| 4th row | SPECIES |
| 5th row | KINGDOM |
| Value | Count | Frequency (%) |
| species | 127830 | |
| kingdom | 28426 | 15.2% |
| genus | 25546 | 13.7% |
| variety | 3379 | 1.8% |
| subspecies | 663 | 0.4% |
| form | 323 | 0.2% |
| family | 230 | 0.1% |
| order | 93 | < 0.1% |
| class | 25 | < 0.1% |
| phylum | 14 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 286004 | |
| S | 283245 | |
| I | 160528 | |
| C | 128518 | |
| P | 128507 | |
| G | 53972 | 4.3% |
| N | 53972 | 4.3% |
| M | 28993 | 2.3% |
| O | 28842 | 2.3% |
| D | 28519 | 2.3% |
| Other values (11) | 74051 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1255151 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 286004 | |
| S | 283245 | |
| I | 160528 | |
| C | 128518 | |
| P | 128507 | |
| G | 53972 | 4.3% |
| N | 53972 | 4.3% |
| M | 28993 | 2.3% |
| O | 28842 | 2.3% |
| D | 28519 | 2.3% |
| Other values (11) | 74051 | 5.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1255151 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 286004 | |
| S | 283245 | |
| I | 160528 | |
| C | 128518 | |
| P | 128507 | |
| G | 53972 | 4.3% |
| N | 53972 | 4.3% |
| M | 28993 | 2.3% |
| O | 28842 | 2.3% |
| D | 28519 | 2.3% |
| Other values (11) | 74051 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1255151 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 286004 | |
| S | 283245 | |
| I | 160528 | |
| C | 128518 | |
| P | 128507 | |
| G | 53972 | 4.3% |
| N | 53972 | 4.3% |
| M | 28993 | 2.3% |
| O | 28842 | 2.3% |
| D | 28519 | 2.3% |
| Other values (11) | 74051 | 5.9% |
vernacularName
Text
| Distinct | 179 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Memory size | 1.4 MiB |
Length
| Max length | 98 |
|---|---|
| Median length | 78 |
| Mean length | 29.30921501 |
| Min length | 5 |
Unique
| Unique | 17 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | rushes; angiosperms; tracheophytes; plants |
|---|---|
| 2nd row | gentians; angiosperms; tracheophytes; plants |
| 3rd row | sedges; angiosperms; tracheophytes; plants |
| 4th row | liverworts; mosses; plants |
| 5th row | plants; plants |
| Value | Count | Frequency (%) |
| plants | 205898 | |
| tracheophytes | 104057 | |
| angiosperms | 81757 | 14.3% |
| mosses | 38430 | 6.7% |
| liverworts | 21780 | 3.8% |
| sedges | 13776 | 2.4% |
| algae | 7363 | 1.3% |
| sunflowers | 7281 | 1.3% |
| grasses | 6277 | 1.1% |
| ferns | 5822 | 1.0% |
| Other values (217) | 78501 | 13.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 752120 | |
| e | 464919 | 8.5% |
| t | 458192 | 8.4% |
| a | 433633 | 7.9% |
| p | 404044 | 7.4% |
| 384431 | 7.0% | |
| ; | 364831 | 6.7% |
| n | 324111 | 5.9% |
| o | 294499 | 5.4% |
| r | 290716 | 5.3% |
| Other values (28) | 1294995 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4711435 | |
| Space Separator | 384431 | 7.0% |
| Other Punctuation | 367273 | 6.7% |
| Dash Punctuation | 2905 | 0.1% |
| Uppercase Letter | 269 | < 0.1% |
| Open Punctuation | 89 | < 0.1% |
| Close Punctuation | 89 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 752120 | |
| e | 464919 | |
| t | 458192 | |
| a | 433633 | |
| p | 404044 | |
| n | 324111 | |
| o | 294499 | 6.3% |
| r | 290716 | 6.2% |
| l | 264824 | 5.6% |
| h | 223345 | 4.7% |
| Other values (15) | 801032 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 78 | |
| G | 62 | |
| J | 54 | |
| B | 47 | |
| P | 27 | 10.0% |
| H | 1 | 0.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 364831 | |
| , | 1369 | 0.4% |
| ' | 1073 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 384431 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2905 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 89 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 89 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4711704 | |
| Common | 754787 | 13.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 752120 | |
| e | 464919 | |
| t | 458192 | |
| a | 433633 | |
| p | 404044 | |
| n | 324111 | |
| o | 294499 | 6.3% |
| r | 290716 | 6.2% |
| l | 264824 | 5.6% |
| h | 223345 | 4.7% |
| Other values (21) | 801301 |
Common
| Value | Count | Frequency (%) |
| 384431 | ||
| ; | 364831 | |
| - | 2905 | 0.4% |
| , | 1369 | 0.2% |
| ' | 1073 | 0.1% |
| ( | 89 | < 0.1% |
| ) | 89 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5466491 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 752120 | |
| e | 464919 | 8.5% |
| t | 458192 | 8.4% |
| a | 433633 | 7.9% |
| p | 404044 | 7.4% |
| 384431 | 7.0% | |
| ; | 364831 | 6.7% |
| n | 324111 | 5.9% |
| o | 294499 | 5.4% |
| r | 290716 | 5.3% |
| Other values (28) | 1294995 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ICBN |
|---|---|
| 2nd row | ICBN |
| 3rd row | ICBN |
| 4th row | ICBN |
| 5th row | ICBN |
| Value | Count | Frequency (%) |
| icbn | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 186529 | |
| C | 186529 | |
| B | 186529 | |
| N | 186529 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 746116 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 186529 | |
| C | 186529 | |
| B | 186529 | |
| N | 186529 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 746116 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 186529 | |
| C | 186529 | |
| B | 186529 | |
| N | 186529 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 746116 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 186529 | |
| C | 186529 | |
| B | 186529 | |
| N | 186529 |
taxonomicStatus
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Memory size | 1.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.759520886 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ACCEPTED |
|---|---|
| 2nd row | ACCEPTED |
| 3rd row | SYNONYM |
| 4th row | ACCEPTED |
| 5th row | ACCEPTED |
| Value | Count | Frequency (%) |
| accepted | 139841 | |
| synonym | 44852 | 24.0% |
| doubtful | 1818 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 279682 | |
| E | 279682 | |
| T | 141659 | |
| D | 141659 | |
| A | 139841 | |
| P | 139841 | |
| Y | 89704 | 6.2% |
| N | 89704 | 6.2% |
| O | 46670 | 3.2% |
| S | 44852 | 3.1% |
| Other values (5) | 53942 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1447236 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 279682 | |
| E | 279682 | |
| T | 141659 | |
| D | 141659 | |
| A | 139841 | |
| P | 139841 | |
| Y | 89704 | 6.2% |
| N | 89704 | 6.2% |
| O | 46670 | 3.2% |
| S | 44852 | 3.1% |
| Other values (5) | 53942 | 3.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1447236 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 279682 | |
| E | 279682 | |
| T | 141659 | |
| D | 141659 | |
| A | 139841 | |
| P | 139841 | |
| Y | 89704 | 6.2% |
| N | 89704 | 6.2% |
| O | 46670 | 3.2% |
| S | 44852 | 3.1% |
| Other values (5) | 53942 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1447236 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 279682 | |
| E | 279682 | |
| T | 141659 | |
| D | 141659 | |
| A | 139841 | |
| P | 139841 | |
| Y | 89704 | 6.2% |
| N | 89704 | 6.2% |
| O | 46670 | 3.2% |
| S | 44852 | 3.1% |
| Other values (5) | 53942 | 3.7% |
taxonRemarks
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 26 |
|---|---|
| Median length | 26 |
| Mean length | 26 |
| Min length | 26 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Animals and Plants: Plants |
|---|---|
| 2nd row | Animals and Plants: Plants |
| 3rd row | Animals and Plants: Plants |
| 4th row | Animals and Plants: Plants |
| 5th row | Animals and Plants: Plants |
| Value | Count | Frequency (%) |
| plants | 373058 | |
| animals | 186529 | |
| and | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 746116 | |
| a | 746116 | |
| l | 559587 | |
| s | 559587 | |
| 559587 | ||
| P | 373058 | |
| t | 373058 | |
| A | 186529 | 3.8% |
| i | 186529 | 3.8% |
| m | 186529 | 3.8% |
| Other values (2) | 373058 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3544051 | |
| Space Separator | 559587 | 11.5% |
| Uppercase Letter | 559587 | 11.5% |
| Other Punctuation | 186529 | 3.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 746116 | |
| a | 746116 | |
| l | 559587 | |
| s | 559587 | |
| t | 373058 | |
| i | 186529 | 5.3% |
| m | 186529 | 5.3% |
| d | 186529 | 5.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 373058 | |
| A | 186529 |
Space Separator
| Value | Count | Frequency (%) |
| 559587 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 186529 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4103638 | |
| Common | 746116 | 15.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 746116 | |
| a | 746116 | |
| l | 559587 | |
| s | 559587 | |
| P | 373058 | |
| t | 373058 | |
| A | 186529 | 4.5% |
| i | 186529 | 4.5% |
| m | 186529 | 4.5% |
| d | 186529 | 4.5% |
Common
| Value | Count | Frequency (%) |
| 559587 | ||
| : | 186529 | 25.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4849754 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 746116 | |
| a | 746116 | |
| l | 559587 | |
| s | 559587 | |
| 559587 | ||
| P | 373058 | |
| t | 373058 | |
| A | 186529 | 3.8% |
| i | 186529 | 3.8% |
| m | 186529 | 3.8% |
| Other values (2) | 373058 |
datasetKey
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 963f12d0-f762-11e1-a439-00145eb45e9a |
|---|---|
| 2nd row | 963f12d0-f762-11e1-a439-00145eb45e9a |
| 3rd row | 963f12d0-f762-11e1-a439-00145eb45e9a |
| 4th row | 963f12d0-f762-11e1-a439-00145eb45e9a |
| 5th row | 963f12d0-f762-11e1-a439-00145eb45e9a |
| Value | Count | Frequency (%) |
| 963f12d0-f762-11e1-a439-00145eb45e9a | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 932645 | |
| - | 746116 | |
| 9 | 559587 | |
| 0 | 559587 | |
| e | 559587 | |
| 4 | 559587 | |
| 6 | 373058 | 5.6% |
| 3 | 373058 | 5.6% |
| f | 373058 | 5.6% |
| 2 | 373058 | 5.6% |
| Other values (5) | 1305703 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4290167 | |
| Lowercase Letter | 1678761 | 25.0% |
| Dash Punctuation | 746116 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 932645 | |
| 9 | 559587 | |
| 0 | 559587 | |
| 4 | 559587 | |
| 6 | 373058 | 8.7% |
| 3 | 373058 | 8.7% |
| 2 | 373058 | 8.7% |
| 5 | 373058 | 8.7% |
| 7 | 186529 | 4.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 559587 | |
| f | 373058 | |
| a | 373058 | |
| d | 186529 | 11.1% |
| b | 186529 | 11.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 746116 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5036283 | |
| Latin | 1678761 | 25.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 932645 | |
| - | 746116 | |
| 9 | 559587 | |
| 0 | 559587 | |
| 4 | 559587 | |
| 6 | 373058 | 7.4% |
| 3 | 373058 | 7.4% |
| 2 | 373058 | 7.4% |
| 5 | 373058 | 7.4% |
| 7 | 186529 | 3.7% |
Latin
| Value | Count | Frequency (%) |
| e | 559587 | |
| f | 373058 | |
| a | 373058 | |
| d | 186529 | 11.1% |
| b | 186529 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6715044 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 932645 | |
| - | 746116 | |
| 9 | 559587 | |
| 0 | 559587 | |
| e | 559587 | |
| 4 | 559587 | |
| 6 | 373058 | 5.6% |
| 3 | 373058 | 5.6% |
| f | 373058 | 5.6% |
| 2 | 373058 | 5.6% |
| Other values (5) | 1305703 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 186529 | |
| S | 186529 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 373058 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 186529 | |
| S | 186529 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 373058 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 186529 | |
| S | 186529 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 373058 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 186529 | |
| S | 186529 |
lastInterpreted
Text
| Distinct | 18062 |
|---|---|
| Distinct (%) | 9.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99661179 |
| Min length | 20 |
Unique
| Unique | 1510 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | 2025-01-07T13:09:13.317Z |
|---|---|
| 2nd row | 2025-01-07T13:09:13.317Z |
| 3rd row | 2025-01-07T13:09:13.317Z |
| 4th row | 2025-01-07T13:09:13.317Z |
| 5th row | 2025-01-07T13:09:13.318Z |
| Value | Count | Frequency (%) |
| 2025-01-07t13:09:09.691z | 61 | < 0.1% |
| 2025-01-07t13:09:14.842z | 55 | < 0.1% |
| 2025-01-07t13:09:13.153z | 54 | < 0.1% |
| 2025-01-07t13:09:14.841z | 53 | < 0.1% |
| 2025-01-07t13:09:14.268z | 52 | < 0.1% |
| 2025-01-07t13:09:10.471z | 51 | < 0.1% |
| 2025-01-07t13:09:13.231z | 51 | < 0.1% |
| 2025-01-07t13:09:13.898z | 51 | < 0.1% |
| 2025-01-07t13:09:12.942z | 51 | < 0.1% |
| 2025-01-07t13:09:14.350z | 50 | < 0.1% |
| Other values (18052) | 186000 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 902065 | |
| 1 | 544038 | |
| 2 | 450890 | |
| - | 373058 | |
| : | 373058 | |
| 3 | 270263 | 6.0% |
| 5 | 267401 | 6.0% |
| 7 | 257877 | 5.8% |
| 9 | 254149 | 5.7% |
| T | 186529 | 4.2% |
| Other values (5) | 596736 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3170519 | |
| Other Punctuation | 559429 | 12.5% |
| Dash Punctuation | 373058 | 8.3% |
| Uppercase Letter | 373058 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 902065 | |
| 1 | 544038 | |
| 2 | 450890 | |
| 3 | 270263 | 8.5% |
| 5 | 267401 | 8.4% |
| 7 | 257877 | 8.1% |
| 9 | 254149 | 8.0% |
| 4 | 80989 | 2.6% |
| 8 | 73891 | 2.3% |
| 6 | 68956 | 2.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 373058 | |
| . | 186371 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 186529 | |
| Z | 186529 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 373058 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4103006 | |
| Latin | 373058 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 902065 | |
| 1 | 544038 | |
| 2 | 450890 | |
| - | 373058 | |
| : | 373058 | |
| 3 | 270263 | 6.6% |
| 5 | 267401 | 6.5% |
| 7 | 257877 | 6.3% |
| 9 | 254149 | 6.2% |
| . | 186371 | 4.5% |
| Other values (3) | 223836 | 5.5% |
Latin
| Value | Count | Frequency (%) |
| T | 186529 | |
| Z | 186529 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4476064 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 902065 | |
| 1 | 544038 | |
| 2 | 450890 | |
| - | 373058 | |
| : | 373058 | |
| 3 | 270263 | 6.0% |
| 5 | 267401 | 6.0% |
| 7 | 257877 | 5.8% |
| 9 | 254149 | 5.7% |
| T | 186529 | 4.2% |
| Other values (5) | 596736 |
elevation
Text
Missing 
| Distinct | 751 |
|---|---|
| Distinct (%) | 9.9% |
| Missing | 178933 |
| Missing (%) | 95.9% |
| Memory size | 1.4 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.40297525 |
| Min length | 3 |
Unique
| Unique | 278 ? |
|---|---|
| Unique (%) | 3.7% |
Sample
| 1st row | 564.0 |
|---|---|
| 2nd row | 1500.0 |
| 3rd row | 1012.0 |
| 4th row | 137.0 |
| 5th row | 1463.0 |
| Value | Count | Frequency (%) |
| 1524.0 | 271 | 3.6% |
| 305.0 | 236 | 3.1% |
| 1219.0 | 194 | 2.6% |
| 1829.0 | 188 | 2.5% |
| 366.0 | 170 | 2.2% |
| 914.0 | 168 | 2.2% |
| 610.0 | 162 | 2.1% |
| 2743.0 | 156 | 2.1% |
| 762.0 | 151 | 2.0% |
| 244.0 | 150 | 2.0% |
| Other values (741) | 5750 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 10709 | |
| . | 7596 | |
| 1 | 4465 | |
| 2 | 3932 | 9.6% |
| 5 | 2676 | 6.5% |
| 3 | 2484 | 6.1% |
| 4 | 2352 | 5.7% |
| 6 | 1935 | 4.7% |
| 7 | 1714 | 4.2% |
| 8 | 1661 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 33445 | |
| Other Punctuation | 7596 | 18.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 10709 | |
| 1 | 4465 | |
| 2 | 3932 | 11.8% |
| 5 | 2676 | 8.0% |
| 3 | 2484 | 7.4% |
| 4 | 2352 | 7.0% |
| 6 | 1935 | 5.8% |
| 7 | 1714 | 5.1% |
| 8 | 1661 | 5.0% |
| 9 | 1517 | 4.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7596 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 41041 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 10709 | |
| . | 7596 | |
| 1 | 4465 | |
| 2 | 3932 | 9.6% |
| 5 | 2676 | 6.5% |
| 3 | 2484 | 6.1% |
| 4 | 2352 | 5.7% |
| 6 | 1935 | 4.7% |
| 7 | 1714 | 4.2% |
| 8 | 1661 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41041 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 10709 | |
| . | 7596 | |
| 1 | 4465 | |
| 2 | 3932 | 9.6% |
| 5 | 2676 | 6.5% |
| 3 | 2484 | 6.1% |
| 4 | 2352 | 5.7% |
| 6 | 1935 | 4.7% |
| 7 | 1714 | 4.2% |
| 8 | 1661 | 4.0% |
Missing 
| Distinct | 77 |
|---|---|
| Distinct (%) | 10.5% |
| Missing | 185793 |
| Missing (%) | 99.6% |
| Memory size | 1.4 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.486413043 |
| Min length | 3 |
Unique
| Unique | 28 ? |
|---|---|
| Unique (%) | 3.8% |
Sample
| 1st row | 50.0 |
|---|---|
| 2nd row | 150.0 |
| 3rd row | 50.0 |
| 4th row | 50.0 |
| 5th row | 172.5 |
| Value | Count | Frequency (%) |
| 50.0 | 100 | 13.6% |
| 327.5 | 62 | 8.4% |
| 100.0 | 59 | 8.0% |
| 152.5 | 57 | 7.7% |
| 0.0 | 48 | 6.5% |
| 150.0 | 40 | 5.4% |
| 390.0 | 26 | 3.5% |
| 76.0 | 24 | 3.3% |
| 62.5 | 23 | 3.1% |
| 381.0 | 19 | 2.6% |
| Other values (67) | 278 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 973 | |
| . | 736 | |
| 5 | 571 | |
| 1 | 242 | 7.3% |
| 2 | 239 | 7.2% |
| 3 | 186 | 5.6% |
| 7 | 159 | 4.8% |
| 6 | 80 | 2.4% |
| 4 | 43 | 1.3% |
| 8 | 38 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2566 | |
| Other Punctuation | 736 | 22.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 973 | |
| 5 | 571 | |
| 1 | 242 | 9.4% |
| 2 | 239 | 9.3% |
| 3 | 186 | 7.2% |
| 7 | 159 | 6.2% |
| 6 | 80 | 3.1% |
| 4 | 43 | 1.7% |
| 8 | 38 | 1.5% |
| 9 | 35 | 1.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 736 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3302 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 973 | |
| . | 736 | |
| 5 | 571 | |
| 1 | 242 | 7.3% |
| 2 | 239 | 7.2% |
| 3 | 186 | 5.6% |
| 7 | 159 | 4.8% |
| 6 | 80 | 2.4% |
| 4 | 43 | 1.3% |
| 8 | 38 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3302 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 973 | |
| . | 736 | |
| 5 | 571 | |
| 1 | 242 | 7.3% |
| 2 | 239 | 7.2% |
| 3 | 186 | 5.6% |
| 7 | 159 | 4.8% |
| 6 | 80 | 2.4% |
| 4 | 43 | 1.3% |
| 8 | 38 | 1.2% |
distanceFromCentroidInMeters
Text
Missing 
| Distinct | 67 |
|---|---|
| Distinct (%) | 15.3% |
| Missing | 186092 |
| Missing (%) | 99.8% |
| Memory size | 1.4 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 15.97482838 |
| Min length | 3 |
Unique
| Unique | 31 ? |
|---|---|
| Unique (%) | 7.1% |
Sample
| 1st row | 1170.0899523613987 |
|---|---|
| 2nd row | 4974.498988381608 |
| 3rd row | 4974.498988381608 |
| 4th row | 4131.791168613916 |
| 5th row | 2047.6989123381013 |
| Value | Count | Frequency (%) |
| 0.0 | 51 | 11.7% |
| 2589.9343731029417 | 42 | 9.6% |
| 4974.498988381608 | 27 | 6.2% |
| 1360.0074314533344 | 26 | 5.9% |
| 2047.6989123381013 | 25 | 5.7% |
| 1632.4374102813665 | 23 | 5.3% |
| 3512.1947738856975 | 21 | 4.8% |
| 2503.4790916570705 | 17 | 3.9% |
| 2092.6375926612645 | 15 | 3.4% |
| 911.6597020315339 | 15 | 3.4% |
| Other values (57) | 175 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 864 | |
| 1 | 814 | |
| 0 | 738 | |
| 9 | 678 | |
| 2 | 612 | |
| 4 | 611 | |
| 7 | 602 | |
| 6 | 563 | |
| 8 | 547 | |
| 5 | 515 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6544 | |
| Other Punctuation | 437 | 6.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 864 | |
| 1 | 814 | |
| 0 | 738 | |
| 9 | 678 | |
| 2 | 612 | |
| 4 | 611 | |
| 7 | 602 | |
| 6 | 563 | |
| 8 | 547 | |
| 5 | 515 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 437 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6981 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 864 | |
| 1 | 814 | |
| 0 | 738 | |
| 9 | 678 | |
| 2 | 612 | |
| 4 | 611 | |
| 7 | 602 | |
| 6 | 563 | |
| 8 | 547 | |
| 5 | 515 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6981 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 864 | |
| 1 | 814 | |
| 0 | 738 | |
| 9 | 678 | |
| 2 | 612 | |
| 4 | 611 | |
| 7 | 602 | |
| 6 | 563 | |
| 8 | 547 | |
| 5 | 515 |
issue
Text
| Distinct | 63 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 193 |
|---|---|
| Median length | 95 |
| Mean length | 97.25331182 |
| Min length | 95 |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;INSTITUTION_MATCH_FUZZY;COLLECTION_MATCH_FUZZY |
|---|---|
| 2nd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;INSTITUTION_MATCH_FUZZY;COLLECTION_MATCH_FUZZY |
| 3rd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;INSTITUTION_MATCH_FUZZY;COLLECTION_MATCH_FUZZY |
| 4th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;INSTITUTION_MATCH_FUZZY;COLLECTION_MATCH_FUZZY |
| 5th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;INSTITUTION_MATCH_FUZZY;COLLECTION_MATCH_FUZZY |
| Value | Count | Frequency (%) |
| occurrence_status_inferred_from_individual_count;institution_match_fuzzy;collection_match_fuzzy | 169646 | |
| occurrence_status_inferred_from_individual_count;coordinate_rounded;institution_match_fuzzy;collection_match_fuzzy | 5872 | 3.1% |
| occurrence_status_inferred_from_individual_count;taxon_match_higherrank;institution_match_fuzzy;collection_match_fuzzy | 2416 | 1.3% |
| occurrence_status_inferred_from_individual_count;recorded_date_mismatch;institution_match_fuzzy;collection_match_fuzzy | 2089 | 1.1% |
| occurrence_status_inferred_from_individual_count;taxon_match_fuzzy;institution_match_fuzzy;collection_match_fuzzy | 1835 | 1.0% |
| occurrence_status_inferred_from_individual_count;continent_coordinate_mismatch;institution_match_fuzzy;collection_match_fuzzy | 1078 | 0.6% |
| occurrence_status_inferred_from_individual_count;country_derived_from_coordinates;institution_match_fuzzy;collection_match_fuzzy | 939 | 0.5% |
| occurrence_status_inferred_from_individual_count;continent_derived_from_coordinates;institution_match_fuzzy;collection_match_fuzzy | 771 | 0.4% |
| occurrence_status_inferred_from_individual_count;coordinate_reprojected;institution_match_fuzzy;collection_match_fuzzy | 279 | 0.1% |
| occurrence_status_inferred_from_individual_count;coordinate_rounded;taxon_match_higherrank;institution_match_fuzzy;collection_match_fuzzy | 198 | 0.1% |
| Other values (53) | 1406 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 1713116 | 9.4% |
| _ | 1711007 | 9.4% |
| C | 1518846 | 8.4% |
| I | 1514962 | 8.4% |
| N | 1340024 | 7.4% |
| U | 1316104 | 7.3% |
| O | 1160937 | 6.4% |
| E | 968295 | 5.3% |
| R | 966824 | 5.3% |
| A | 776170 | 4.3% |
| Other values (18) | 5154278 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 16037928 | |
| Connector Punctuation | 1711007 | 9.4% |
| Other Punctuation | 391262 | 2.2% |
| Decimal Number | 366 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1713116 | |
| C | 1518846 | 9.5% |
| I | 1514962 | 9.4% |
| N | 1340024 | 8.4% |
| U | 1316104 | 8.2% |
| O | 1160937 | 7.2% |
| E | 968295 | 6.0% |
| R | 966824 | 6.0% |
| A | 776170 | 4.8% |
| F | 750473 | 4.7% |
| Other values (14) | 4012177 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 183 | |
| 4 | 183 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1711007 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 391262 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16037928 | |
| Common | 2102635 | 11.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 1713116 | |
| C | 1518846 | 9.5% |
| I | 1514962 | 9.4% |
| N | 1340024 | 8.4% |
| U | 1316104 | 8.2% |
| O | 1160937 | 7.2% |
| E | 968295 | 6.0% |
| R | 966824 | 6.0% |
| A | 776170 | 4.8% |
| F | 750473 | 4.7% |
| Other values (14) | 4012177 |
Common
| Value | Count | Frequency (%) |
| _ | 1711007 | |
| ; | 391262 | 18.6% |
| 8 | 183 | < 0.1% |
| 4 | 183 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18140563 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 1713116 | 9.4% |
| _ | 1711007 | 9.4% |
| C | 1518846 | 8.4% |
| I | 1514962 | 8.4% |
| N | 1340024 | 7.4% |
| U | 1316104 | 7.3% |
| O | 1160937 | 6.4% |
| E | 968295 | 5.3% |
| R | 966824 | 5.3% |
| A | 776170 | 4.3% |
| Other values (18) | 5154278 |
mediaType
Text
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9347 |
| Missing (%) | 5.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 98 |
|---|---|
| Median length | 10 |
| Mean length | 11.46975426 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | StillImage |
|---|---|
| 2nd row | StillImage |
| 3rd row | StillImage |
| 4th row | StillImage;StillImage |
| 5th row | StillImage |
| Value | Count | Frequency (%) |
| stillimage | 155884 | |
| stillimage;stillimage | 19435 | 11.0% |
| stillimage;stillimage;stillimage | 1484 | 0.8% |
| stillimage;stillimage;stillimage;stillimage | 290 | 0.2% |
| stillimage;stillimage;stillimage;stillimage;stillimage | 63 | < 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 15 | < 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 6 | < 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 3 | < 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 401712 | |
| S | 200856 | |
| t | 200856 | |
| i | 200856 | |
| I | 200856 | |
| m | 200856 | |
| a | 200856 | |
| g | 200856 | |
| e | 200856 | |
| ; | 23674 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1606848 | |
| Uppercase Letter | 401712 | 19.8% |
| Other Punctuation | 23674 | 1.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 401712 | |
| t | 200856 | |
| i | 200856 | |
| m | 200856 | |
| a | 200856 | |
| g | 200856 | |
| e | 200856 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 200856 | |
| I | 200856 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 23674 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2008560 | |
| Common | 23674 | 1.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 401712 | |
| S | 200856 | |
| t | 200856 | |
| i | 200856 | |
| I | 200856 | |
| m | 200856 | |
| a | 200856 | |
| g | 200856 | |
| e | 200856 |
Common
| Value | Count | Frequency (%) |
| ; | 23674 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2032234 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 401712 | |
| S | 200856 | |
| t | 200856 | |
| i | 200856 | |
| I | 200856 | |
| m | 200856 | |
| a | 200856 | |
| g | 200856 | |
| e | 200856 | |
| ; | 23674 | 1.2% |
hasCoordinate
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.440146036 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | true |
| 3rd row | true |
| 4th row | true |
| 5th row | false |
| Value | Count | Frequency (%) |
| true | 104429 | |
| false | 82100 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 186529 | |
| t | 104429 | |
| r | 104429 | |
| u | 104429 | |
| f | 82100 | |
| a | 82100 | |
| l | 82100 | |
| s | 82100 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 828216 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 186529 | |
| t | 104429 | |
| r | 104429 | |
| u | 104429 | |
| f | 82100 | |
| a | 82100 | |
| l | 82100 | |
| s | 82100 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 828216 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 186529 | |
| t | 104429 | |
| r | 104429 | |
| u | 104429 | |
| f | 82100 | |
| a | 82100 | |
| l | 82100 | |
| s | 82100 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 828216 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 186529 | |
| t | 104429 | |
| r | 104429 | |
| u | 104429 | |
| f | 82100 | |
| a | 82100 | |
| l | 82100 | |
| s | 82100 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.999555029 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 186446 | |
| true | 83 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 186529 | |
| f | 186446 | |
| a | 186446 | |
| l | 186446 | |
| s | 186446 | |
| t | 83 | < 0.1% |
| r | 83 | < 0.1% |
| u | 83 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 932562 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 186529 | |
| f | 186446 | |
| a | 186446 | |
| l | 186446 | |
| s | 186446 | |
| t | 83 | < 0.1% |
| r | 83 | < 0.1% |
| u | 83 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 932562 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 186529 | |
| f | 186446 | |
| a | 186446 | |
| l | 186446 | |
| s | 186446 | |
| t | 83 | < 0.1% |
| r | 83 | < 0.1% |
| u | 83 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 932562 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 186529 | |
| f | 186446 | |
| a | 186446 | |
| l | 186446 | |
| s | 186446 | |
| t | 83 | < 0.1% |
| r | 83 | < 0.1% |
| u | 83 | < 0.1% |
taxonKey
Text
| Distinct | 15722 |
|---|---|
| Distinct (%) | 8.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.088613567 |
| Min length | 1 |
Unique
| Unique | 6979 ? |
|---|---|
| Unique (%) | 3.7% |
Sample
| 1st row | 2700991 |
|---|---|
| 2nd row | 3170096 |
| 3rd row | 2728062 |
| 4th row | 4276910 |
| 5th row | 6 |
| Value | Count | Frequency (%) |
| 6 | 28377 | 15.2% |
| 2721893 | 1378 | 0.7% |
| 2651126 | 1339 | 0.7% |
| 2650111 | 1163 | 0.6% |
| 3196548 | 1155 | 0.6% |
| 2650583 | 961 | 0.5% |
| 2933951 | 736 | 0.4% |
| 2651736 | 535 | 0.3% |
| 2650888 | 527 | 0.3% |
| 4277138 | 495 | 0.3% |
| Other values (15712) | 149863 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 179759 | |
| 6 | 140693 | |
| 7 | 120853 | |
| 5 | 113020 | |
| 3 | 112168 | |
| 8 | 106777 | |
| 1 | 105406 | |
| 0 | 93031 | |
| 9 | 84742 | |
| 4 | 79254 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1135703 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 179759 | |
| 6 | 140693 | |
| 7 | 120853 | |
| 5 | 113020 | |
| 3 | 112168 | |
| 8 | 106777 | |
| 1 | 105406 | |
| 0 | 93031 | |
| 9 | 84742 | |
| 4 | 79254 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1135703 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 179759 | |
| 6 | 140693 | |
| 7 | 120853 | |
| 5 | 113020 | |
| 3 | 112168 | |
| 8 | 106777 | |
| 1 | 105406 | |
| 0 | 93031 | |
| 9 | 84742 | |
| 4 | 79254 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1135703 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 179759 | |
| 6 | 140693 | |
| 7 | 120853 | |
| 5 | 113020 | |
| 3 | 112168 | |
| 8 | 106777 | |
| 1 | 105406 | |
| 0 | 93031 | |
| 9 | 84742 | |
| 4 | 79254 |
acceptedTaxonKey
Text
| Distinct | 13242 |
|---|---|
| Distinct (%) | 7.1% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Memory size | 1.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.097645715 |
| Min length | 1 |
Unique
| Unique | 5385 ? |
|---|---|
| Unique (%) | 2.9% |
Sample
| 1st row | 2700991 |
|---|---|
| 2nd row | 3170096 |
| 3rd row | 2728060 |
| 4th row | 4276910 |
| 5th row | 6 |
| Value | Count | Frequency (%) |
| 6 | 28377 | 15.2% |
| 2721893 | 1395 | 0.7% |
| 2651126 | 1343 | 0.7% |
| 2650111 | 1163 | 0.6% |
| 3196548 | 1155 | 0.6% |
| 2650583 | 1063 | 0.6% |
| 2933951 | 736 | 0.4% |
| 2651736 | 535 | 0.3% |
| 2650888 | 527 | 0.3% |
| 2689220 | 495 | 0.3% |
| Other values (13232) | 149722 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 182570 | |
| 6 | 143478 | |
| 7 | 116217 | |
| 8 | 112612 | |
| 5 | 110350 | |
| 3 | 109965 | |
| 1 | 107204 | |
| 0 | 94336 | |
| 9 | 88487 | |
| 4 | 72059 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1137278 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 182570 | |
| 6 | 143478 | |
| 7 | 116217 | |
| 8 | 112612 | |
| 5 | 110350 | |
| 3 | 109965 | |
| 1 | 107204 | |
| 0 | 94336 | |
| 9 | 88487 | |
| 4 | 72059 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1137278 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 182570 | |
| 6 | 143478 | |
| 7 | 116217 | |
| 8 | 112612 | |
| 5 | 110350 | |
| 3 | 109965 | |
| 1 | 107204 | |
| 0 | 94336 | |
| 9 | 88487 | |
| 4 | 72059 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1137278 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 182570 | |
| 6 | 143478 | |
| 7 | 116217 | |
| 8 | 112612 | |
| 5 | 110350 | |
| 3 | 109965 | |
| 1 | 107204 | |
| 0 | 94336 | |
| 9 | 88487 | |
| 4 | 72059 | 6.3% |
kingdomKey
Text
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 6 |
|---|---|
| 2nd row | 6 |
| 3rd row | 6 |
| 4th row | 6 |
| 5th row | 6 |
| Value | Count | Frequency (%) |
| 6 | 177496 | |
| 5 | 5161 | 2.8% |
| 4 | 2981 | 1.6% |
| 3 | 869 | 0.5% |
| 0 | 18 | < 0.1% |
| 1 | 2 | < 0.1% |
| 7 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 177496 | |
| 5 | 5161 | 2.8% |
| 4 | 2981 | 1.6% |
| 3 | 869 | 0.5% |
| 0 | 18 | < 0.1% |
| 1 | 2 | < 0.1% |
| 7 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 186529 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 177496 | |
| 5 | 5161 | 2.8% |
| 4 | 2981 | 1.6% |
| 3 | 869 | 0.5% |
| 0 | 18 | < 0.1% |
| 1 | 2 | < 0.1% |
| 7 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 186529 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 177496 | |
| 5 | 5161 | 2.8% |
| 4 | 2981 | 1.6% |
| 3 | 869 | 0.5% |
| 0 | 18 | < 0.1% |
| 1 | 2 | < 0.1% |
| 7 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 186529 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 177496 | |
| 5 | 5161 | 2.8% |
| 4 | 2981 | 1.6% |
| 3 | 869 | 0.5% |
| 0 | 18 | < 0.1% |
| 1 | 2 | < 0.1% |
| 7 | 2 | < 0.1% |
phylumKey
Text
Missing 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 28431 |
| Missing (%) | 15.2% |
| Memory size | 1.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 5.208237928 |
| Min length | 1 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 7707728 |
|---|---|
| 2nd row | 7707728 |
| 3rd row | 7707728 |
| 4th row | 9 |
| 5th row | 7707728 |
| Value | Count | Frequency (%) |
| 7707728 | 104064 | |
| 9 | 21776 | 13.8% |
| 35 | 14896 | 9.4% |
| 106 | 5566 | 3.5% |
| 95 | 5121 | 3.2% |
| 98 | 2980 | 1.9% |
| 36 | 1763 | 1.1% |
| 68 | 867 | 0.5% |
| 7819616 | 620 | 0.4% |
| 13 | 428 | 0.3% |
| Other values (7) | 17 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 416878 | |
| 0 | 109631 | 13.3% |
| 8 | 108533 | 13.2% |
| 2 | 104066 | 12.6% |
| 9 | 30498 | 3.7% |
| 5 | 20020 | 2.4% |
| 3 | 17099 | 2.1% |
| 6 | 9438 | 1.1% |
| 1 | 7238 | 0.9% |
| 4 | 11 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 823412 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 416878 | |
| 0 | 109631 | 13.3% |
| 8 | 108533 | 13.2% |
| 2 | 104066 | 12.6% |
| 9 | 30498 | 3.7% |
| 5 | 20020 | 2.4% |
| 3 | 17099 | 2.1% |
| 6 | 9438 | 1.1% |
| 1 | 7238 | 0.9% |
| 4 | 11 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 823412 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 416878 | |
| 0 | 109631 | 13.3% |
| 8 | 108533 | 13.2% |
| 2 | 104066 | 12.6% |
| 9 | 30498 | 3.7% |
| 5 | 20020 | 2.4% |
| 3 | 17099 | 2.1% |
| 6 | 9438 | 1.1% |
| 1 | 7238 | 0.9% |
| 4 | 11 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 823412 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 416878 | |
| 0 | 109631 | 13.3% |
| 8 | 108533 | 13.2% |
| 2 | 104066 | 12.6% |
| 9 | 30498 | 3.7% |
| 5 | 20020 | 2.4% |
| 3 | 17099 | 2.1% |
| 6 | 9438 | 1.1% |
| 1 | 7238 | 0.9% |
| 4 | 11 | < 0.1% |
classKey
Text
Missing 
| Distinct | 49 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 28457 |
| Missing (%) | 15.3% |
| Memory size | 1.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 3.594324105 |
| Min length | 3 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 196 |
|---|---|
| 2nd row | 220 |
| 3rd row | 196 |
| 4th row | 126 |
| 5th row | 220 |
| Value | Count | Frequency (%) |
| 220 | 47673 | |
| 196 | 34102 | |
| 126 | 19329 | |
| 7228684 | 18403 | 11.6% |
| 327 | 11812 | 7.5% |
| 342 | 5407 | 3.4% |
| 180 | 4619 | 2.9% |
| 7073593 | 2903 | 1.8% |
| 125 | 2447 | 1.5% |
| 190 | 2360 | 1.5% |
| Other values (39) | 9017 | 5.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 176119 | |
| 6 | 73133 | |
| 1 | 69453 | 12.2% |
| 0 | 58587 | 10.3% |
| 9 | 43889 | 7.7% |
| 8 | 42426 | 7.5% |
| 7 | 38983 | 6.9% |
| 4 | 28699 | 5.1% |
| 3 | 26684 | 4.7% |
| 5 | 10189 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 568162 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 176119 | |
| 6 | 73133 | |
| 1 | 69453 | 12.2% |
| 0 | 58587 | 10.3% |
| 9 | 43889 | 7.7% |
| 8 | 42426 | 7.5% |
| 7 | 38983 | 6.9% |
| 4 | 28699 | 5.1% |
| 3 | 26684 | 4.7% |
| 5 | 10189 | 1.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 568162 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 176119 | |
| 6 | 73133 | |
| 1 | 69453 | 12.2% |
| 0 | 58587 | 10.3% |
| 9 | 43889 | 7.7% |
| 8 | 42426 | 7.5% |
| 7 | 38983 | 6.9% |
| 4 | 28699 | 5.1% |
| 3 | 26684 | 4.7% |
| 5 | 10189 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 568162 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 176119 | |
| 6 | 73133 | |
| 1 | 69453 | 12.2% |
| 0 | 58587 | 10.3% |
| 9 | 43889 | 7.7% |
| 8 | 42426 | 7.5% |
| 7 | 38983 | 6.9% |
| 4 | 28699 | 5.1% |
| 3 | 26684 | 4.7% |
| 5 | 10189 | 1.8% |
orderKey
Text
Missing 
| Distinct | 249 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 28496 |
| Missing (%) | 15.3% |
| Memory size | 1.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 3.62612872 |
| Min length | 3 |
Unique
| Unique | 23 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1369 |
|---|---|
| 2nd row | 412 |
| 3rd row | 1369 |
| 4th row | 381 |
| 5th row | 408 |
| Value | Count | Frequency (%) |
| 1369 | 23133 | 14.6% |
| 392 | 14202 | 9.0% |
| 381 | 11845 | 7.5% |
| 414 | 7806 | 4.9% |
| 1169 | 5708 | 3.6% |
| 617 | 5685 | 3.6% |
| 1370 | 5474 | 3.5% |
| 377 | 5373 | 3.4% |
| 408 | 4708 | 3.0% |
| 691 | 3883 | 2.5% |
| Other values (239) | 70216 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 120264 | |
| 3 | 93801 | |
| 9 | 66033 | |
| 6 | 64887 | |
| 4 | 53476 | |
| 2 | 51228 | |
| 7 | 38365 | 6.7% |
| 0 | 29712 | 5.2% |
| 8 | 27679 | 4.8% |
| 5 | 27603 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 573048 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 120264 | |
| 3 | 93801 | |
| 9 | 66033 | |
| 6 | 64887 | |
| 4 | 53476 | |
| 2 | 51228 | |
| 7 | 38365 | 6.7% |
| 0 | 29712 | 5.2% |
| 8 | 27679 | 4.8% |
| 5 | 27603 | 4.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 573048 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 120264 | |
| 3 | 93801 | |
| 9 | 66033 | |
| 6 | 64887 | |
| 4 | 53476 | |
| 2 | 51228 | |
| 7 | 38365 | 6.7% |
| 0 | 29712 | 5.2% |
| 8 | 27679 | 4.8% |
| 5 | 27603 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 573048 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 120264 | |
| 3 | 93801 | |
| 9 | 66033 | |
| 6 | 64887 | |
| 4 | 53476 | |
| 2 | 51228 | |
| 7 | 38365 | 6.7% |
| 0 | 29712 | 5.2% |
| 8 | 27679 | 4.8% |
| 5 | 27603 | 4.8% |
familyKey
Text
Missing 
| Distinct | 815 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 28710 |
| Missing (%) | 15.4% |
| Memory size | 1.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 4.19253702 |
| Min length | 4 |
Unique
| Unique | 100 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 5353 |
|---|---|
| 2nd row | 2503 |
| 3rd row | 7708 |
| 4th row | 6134 |
| 5th row | 4194986 |
| Value | Count | Frequency (%) |
| 7708 | 13776 | 8.7% |
| 3065 | 7289 | 4.6% |
| 3073 | 6277 | 4.0% |
| 5386 | 4763 | 3.0% |
| 7689 | 3466 | 2.2% |
| 2373 | 3452 | 2.2% |
| 5015 | 3290 | 2.1% |
| 2367 | 3008 | 1.9% |
| 4673 | 2360 | 1.5% |
| 5353 | 2219 | 1.4% |
| Other values (805) | 107919 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 97237 | |
| 7 | 89007 | |
| 2 | 83824 | |
| 3 | 80434 | |
| 8 | 66968 | |
| 0 | 57302 | |
| 5 | 50492 | |
| 4 | 50488 | |
| 1 | 47343 | |
| 9 | 38567 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 661662 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 97237 | |
| 7 | 89007 | |
| 2 | 83824 | |
| 3 | 80434 | |
| 8 | 66968 | |
| 0 | 57302 | |
| 5 | 50492 | |
| 4 | 50488 | |
| 1 | 47343 | |
| 9 | 38567 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 661662 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 97237 | |
| 7 | 89007 | |
| 2 | 83824 | |
| 3 | 80434 | |
| 8 | 66968 | |
| 0 | 57302 | |
| 5 | 50492 | |
| 4 | 50488 | |
| 1 | 47343 | |
| 9 | 38567 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 661662 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 97237 | |
| 7 | 89007 | |
| 2 | 83824 | |
| 3 | 80434 | |
| 8 | 66968 | |
| 0 | 57302 | |
| 5 | 50492 | |
| 4 | 50488 | |
| 1 | 47343 | |
| 9 | 38567 | 5.8% |
genusKey
Text
Missing 
| Distinct | 4124 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 28788 |
| Missing (%) | 15.4% |
| Memory size | 1.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.016438339 |
| Min length | 7 |
Unique
| Unique | 1062 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | 2700604 |
|---|---|
| 2nd row | 3170037 |
| 3rd row | 2721893 |
| 4th row | 4276909 |
| 5th row | 6008574 |
| Value | Count | Frequency (%) |
| 2721893 | 8831 | 5.6% |
| 2668958 | 2360 | 1.5% |
| 2651126 | 2269 | 1.4% |
| 2701072 | 1784 | 1.1% |
| 2688736 | 1708 | 1.1% |
| 2650583 | 1705 | 1.1% |
| 2689215 | 1518 | 1.0% |
| 3196548 | 1505 | 1.0% |
| 2650111 | 1375 | 0.9% |
| 2874237 | 1213 | 0.8% |
| Other values (4114) | 133473 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 187980 | |
| 8 | 133207 | |
| 6 | 127549 | |
| 3 | 111156 | |
| 7 | 108846 | |
| 1 | 100782 | |
| 9 | 99825 | |
| 5 | 90092 | |
| 0 | 89046 | |
| 4 | 58297 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1106780 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 187980 | |
| 8 | 133207 | |
| 6 | 127549 | |
| 3 | 111156 | |
| 7 | 108846 | |
| 1 | 100782 | |
| 9 | 99825 | |
| 5 | 90092 | |
| 0 | 89046 | |
| 4 | 58297 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1106780 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 187980 | |
| 8 | 133207 | |
| 6 | 127549 | |
| 3 | 111156 | |
| 7 | 108846 | |
| 1 | 100782 | |
| 9 | 99825 | |
| 5 | 90092 | |
| 0 | 89046 | |
| 4 | 58297 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1106780 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 187980 | |
| 8 | 133207 | |
| 6 | 127549 | |
| 3 | 111156 | |
| 7 | 108846 | |
| 1 | 100782 | |
| 9 | 99825 | |
| 5 | 90092 | |
| 0 | 89046 | |
| 4 | 58297 | 5.3% |
speciesKey
Text
Missing 
| Distinct | 11415 |
|---|---|
| Distinct (%) | 8.6% |
| Missing | 54335 |
| Missing (%) | 29.1% |
| Memory size | 1.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.020250541 |
| Min length | 7 |
Unique
| Unique | 4601 ? |
|---|---|
| Unique (%) | 3.5% |
Sample
| 1st row | 2700991 |
|---|---|
| 2nd row | 3170096 |
| 3rd row | 2728060 |
| 4th row | 4276910 |
| 5th row | 6070732 |
| Value | Count | Frequency (%) |
| 2689220 | 495 | 0.4% |
| 4276980 | 477 | 0.4% |
| 2689218 | 392 | 0.3% |
| 4276912 | 379 | 0.3% |
| 2689212 | 359 | 0.3% |
| 5710205 | 342 | 0.3% |
| 2689327 | 337 | 0.3% |
| 2688707 | 333 | 0.3% |
| 2688970 | 318 | 0.2% |
| 5286325 | 288 | 0.2% |
| Other values (11405) | 128474 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 149599 | |
| 7 | 99507 | |
| 8 | 97422 | |
| 3 | 95610 | |
| 5 | 89780 | |
| 6 | 89662 | |
| 1 | 86682 | |
| 0 | 79003 | |
| 9 | 76358 | |
| 4 | 64412 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 928035 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 149599 | |
| 7 | 99507 | |
| 8 | 97422 | |
| 3 | 95610 | |
| 5 | 89780 | |
| 6 | 89662 | |
| 1 | 86682 | |
| 0 | 79003 | |
| 9 | 76358 | |
| 4 | 64412 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 928035 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 149599 | |
| 7 | 99507 | |
| 8 | 97422 | |
| 3 | 95610 | |
| 5 | 89780 | |
| 6 | 89662 | |
| 1 | 86682 | |
| 0 | 79003 | |
| 9 | 76358 | |
| 4 | 64412 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 928035 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 149599 | |
| 7 | 99507 | |
| 8 | 97422 | |
| 3 | 95610 | |
| 5 | 89780 | |
| 6 | 89662 | |
| 1 | 86682 | |
| 0 | 79003 | |
| 9 | 76358 | |
| 4 | 64412 |
species
Text
Missing 
| Distinct | 11401 |
|---|---|
| Distinct (%) | 8.6% |
| Missing | 54335 |
| Missing (%) | 29.1% |
| Memory size | 1.4 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 30 |
| Mean length | 18.93441457 |
| Min length | 8 |
Unique
| Unique | 4591 ? |
|---|---|
| Unique (%) | 3.5% |
Sample
| 1st row | Luzula bulbosa |
|---|---|
| 2nd row | Gentiana clausa |
| 3rd row | Carex vulpinoidea |
| 4th row | Lophocolea minor |
| 5th row | Mimulus ringens |
| Value | Count | Frequency (%) |
| carex | 7436 | 2.8% |
| sphagnum | 2358 | 0.9% |
| frullania | 1702 | 0.6% |
| canadensis | 1589 | 0.6% |
| scapania | 1515 | 0.6% |
| juncus | 1326 | 0.5% |
| viola | 1196 | 0.5% |
| viburnum | 1126 | 0.4% |
| dichanthelium | 996 | 0.4% |
| cyperus | 990 | 0.4% |
| Other values (9729) | 244228 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 299823 | 12.0% |
| i | 240977 | 9.6% |
| e | 162367 | 6.5% |
| l | 155318 | 6.2% |
| r | 153244 | 6.1% |
| u | 145277 | 5.8% |
| o | 142900 | 5.7% |
| s | 139597 | 5.6% |
| 132268 | 5.3% | |
| n | 131280 | 5.2% |
| Other values (44) | 799965 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2237753 | |
| Space Separator | 132268 | 5.3% |
| Uppercase Letter | 132194 | 5.3% |
| Dash Punctuation | 801 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 299823 | |
| i | 240977 | |
| e | 162367 | 7.3% |
| l | 155318 | 6.9% |
| r | 153244 | 6.8% |
| u | 145277 | 6.5% |
| o | 142900 | 6.4% |
| s | 139597 | 6.2% |
| n | 131280 | 5.9% |
| t | 108642 | 4.9% |
| Other values (16) | 558328 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 21176 | |
| S | 16784 | |
| P | 16487 | |
| A | 9762 | 7.4% |
| L | 7739 | 5.9% |
| D | 6545 | 5.0% |
| R | 6344 | 4.8% |
| M | 5621 | 4.3% |
| E | 5440 | 4.1% |
| B | 5064 | 3.8% |
| Other values (16) | 31232 |
Space Separator
| Value | Count | Frequency (%) |
| 132268 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 801 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2369947 | |
| Common | 133069 | 5.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 299823 | |
| i | 240977 | 10.2% |
| e | 162367 | 6.9% |
| l | 155318 | 6.6% |
| r | 153244 | 6.5% |
| u | 145277 | 6.1% |
| o | 142900 | 6.0% |
| s | 139597 | 5.9% |
| n | 131280 | 5.5% |
| t | 108642 | 4.6% |
| Other values (42) | 690522 |
Common
| Value | Count | Frequency (%) |
| 132268 | ||
| - | 801 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2503016 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 299823 | 12.0% |
| i | 240977 | 9.6% |
| e | 162367 | 6.5% |
| l | 155318 | 6.2% |
| r | 153244 | 6.1% |
| u | 145277 | 5.8% |
| o | 142900 | 5.7% |
| s | 139597 | 5.6% |
| 132268 | 5.3% | |
| n | 131280 | 5.2% |
| Other values (44) | 799965 |
| Distinct | 13241 |
|---|---|
| Distinct (%) | 7.1% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Memory size | 1.4 MiB |
Length
| Max length | 122 |
|---|---|
| Median length | 81 |
| Mean length | 25.88063439 |
| Min length | 5 |
Unique
| Unique | 5383 ? |
|---|---|
| Unique (%) | 2.9% |
Sample
| 1st row | Luzula bulbosa (Alph.Wood) Smyth & L.C.R.Smyth |
|---|---|
| 2nd row | Gentiana clausa Raf. |
| 3rd row | Carex vulpinoidea Michx. |
| 4th row | Lophocolea minor Nees |
| 5th row | Plantae |
| Value | Count | Frequency (%) |
| l | 53323 | 8.7% |
| plantae | 28377 | 4.7% |
| 14536 | 2.4% | |
| ex | 10569 | 1.7% |
| carex | 8831 | 1.4% |
| hedw | 6411 | 1.1% |
| willd | 5097 | 0.8% |
| dumort | 4794 | 0.8% |
| michx | 4667 | 0.8% |
| subsp | 3140 | 0.5% |
| Other values (13765) | 470434 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 449227 | 9.3% |
| 423668 | 8.8% | |
| i | 320087 | 6.6% |
| e | 303237 | 6.3% |
| l | 260082 | 5.4% |
| r | 247172 | 5.1% |
| n | 228432 | 4.7% |
| . | 219057 | 4.5% |
| o | 211700 | 4.4% |
| s | 202864 | 4.2% |
| Other values (100) | 1961497 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3519415 | |
| Uppercase Letter | 491763 | 10.2% |
| Space Separator | 423668 | 8.8% |
| Other Punctuation | 241072 | 5.0% |
| Open Punctuation | 66474 | 1.4% |
| Close Punctuation | 66474 | 1.4% |
| Decimal Number | 15952 | 0.3% |
| Dash Punctuation | 1598 | < 0.1% |
| Math Symbol | 598 | < 0.1% |
| Connector Punctuation | 9 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 449227 | |
| i | 320087 | 9.1% |
| e | 303237 | 8.6% |
| l | 260082 | 7.4% |
| r | 247172 | 7.0% |
| n | 228432 | 6.5% |
| o | 211700 | 6.0% |
| s | 202864 | 5.8% |
| t | 199790 | 5.7% |
| u | 197691 | 5.6% |
| Other values (48) | 899133 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 78990 | |
| P | 58272 | |
| S | 47010 | 9.6% |
| C | 37377 | 7.6% |
| A | 31917 | 6.5% |
| M | 26946 | 5.5% |
| H | 26372 | 5.4% |
| B | 22862 | 4.6% |
| D | 22849 | 4.6% |
| R | 19557 | 4.0% |
| Other values (22) | 119611 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4467 | |
| 8 | 3584 | |
| 2 | 2003 | |
| 0 | 1837 | |
| 9 | 1077 | 6.8% |
| 3 | 852 | 5.3% |
| 4 | 733 | 4.6% |
| 7 | 540 | 3.4% |
| 5 | 447 | 2.8% |
| 6 | 412 | 2.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 219057 | |
| & | 14536 | 6.0% |
| , | 7310 | 3.0% |
| ' | 169 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 423668 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 66474 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 66474 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1598 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 598 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 9 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4011178 | |
| Common | 815845 | 16.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 449227 | 11.2% |
| i | 320087 | 8.0% |
| e | 303237 | 7.6% |
| l | 260082 | 6.5% |
| r | 247172 | 6.2% |
| n | 228432 | 5.7% |
| o | 211700 | 5.3% |
| s | 202864 | 5.1% |
| t | 199790 | 5.0% |
| u | 197691 | 4.9% |
| Other values (80) | 1390896 |
Common
| Value | Count | Frequency (%) |
| 423668 | ||
| . | 219057 | |
| ( | 66474 | 8.1% |
| ) | 66474 | 8.1% |
| & | 14536 | 1.8% |
| , | 7310 | 0.9% |
| 1 | 4467 | 0.5% |
| 8 | 3584 | 0.4% |
| 2 | 2003 | 0.2% |
| 0 | 1837 | 0.2% |
| Other values (10) | 6435 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4815169 | |
| None | 11854 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 449227 | 9.3% |
| 423668 | 8.8% | |
| i | 320087 | 6.6% |
| e | 303237 | 6.3% |
| l | 260082 | 5.4% |
| r | 247172 | 5.1% |
| n | 228432 | 4.7% |
| . | 219057 | 4.5% |
| o | 211700 | 4.4% |
| s | 202864 | 4.2% |
| Other values (61) | 1949643 |
None
| Value | Count | Frequency (%) |
| ü | 2478 | |
| ö | 2362 | |
| á | 1752 | |
| ň | 1314 | |
| ä | 966 | 8.1% |
| é | 722 | 6.1% |
| × | 598 | 5.0% |
| Á | 349 | 2.9% |
| ø | 272 | 2.3% |
| Å | 263 | 2.2% |
| Other values (29) | 778 | 6.6% |
| Distinct | 16379 |
|---|---|
| Distinct (%) | 8.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 50 |
|---|---|
| Median length | 43 |
| Mean length | 15.95681637 |
| Min length | 3 |
Unique
| Unique | 7549 ? |
|---|---|
| Unique (%) | 4.0% |
Sample
| 1st row | Luzula bulbosa |
|---|---|
| 2nd row | Gentiana clausa |
| 3rd row | Carex muhlenbergii |
| 4th row | Lophocolea minor |
| 5th row | Plantae |
| Value | Count | Frequency (%) |
| plantae | 28374 | 8.6% |
| carex | 8803 | 2.7% |
| var | 3699 | 1.1% |
| dryopteris | 2392 | 0.7% |
| sphagnum | 2360 | 0.7% |
| juncus | 1814 | 0.5% |
| frullania | 1708 | 0.5% |
| asplenium | 1557 | 0.5% |
| scapania | 1517 | 0.5% |
| canadensis | 1515 | 0.5% |
| Other values (11105) | 276305 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 389969 | |
| i | 262458 | 8.8% |
| e | 205819 | 6.9% |
| l | 196972 | 6.6% |
| r | 175234 | 5.9% |
| n | 170772 | 5.7% |
| u | 162576 | 5.5% |
| o | 156926 | 5.3% |
| s | 154857 | 5.2% |
| t | 146008 | 4.9% |
| Other values (48) | 954818 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2641527 | |
| Uppercase Letter | 186514 | 6.3% |
| Space Separator | 143515 | 4.8% |
| Other Punctuation | 4146 | 0.1% |
| Dash Punctuation | 705 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 389969 | |
| i | 262458 | |
| e | 205819 | 7.8% |
| l | 196972 | 7.5% |
| r | 175234 | 6.6% |
| n | 170772 | 6.5% |
| u | 162576 | 6.2% |
| o | 156926 | 5.9% |
| s | 154857 | 5.9% |
| t | 146008 | 5.5% |
| Other values (16) | 619936 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 49351 | |
| C | 26040 | |
| S | 16978 | 9.1% |
| A | 13951 | 7.5% |
| L | 10862 | 5.8% |
| D | 7781 | 4.2% |
| R | 6989 | 3.7% |
| E | 6742 | 3.6% |
| B | 6396 | 3.4% |
| M | 6026 | 3.2% |
| Other values (16) | 35398 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4144 | |
| ? | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 143515 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 705 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2828041 | |
| Common | 148368 | 5.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 389969 | |
| i | 262458 | 9.3% |
| e | 205819 | 7.3% |
| l | 196972 | 7.0% |
| r | 175234 | 6.2% |
| n | 170772 | 6.0% |
| u | 162576 | 5.7% |
| o | 156926 | 5.5% |
| s | 154857 | 5.5% |
| t | 146008 | 5.2% |
| Other values (42) | 806450 |
Common
| Value | Count | Frequency (%) |
| 143515 | ||
| . | 4144 | 2.8% |
| - | 705 | 0.5% |
| ? | 2 | < 0.1% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2976409 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 389969 | |
| i | 262458 | 8.8% |
| e | 205819 | 6.9% |
| l | 196972 | 6.6% |
| r | 175234 | 5.9% |
| n | 170772 | 5.7% |
| u | 162576 | 5.5% |
| o | 156926 | 5.3% |
| s | 154857 | 5.2% |
| t | 146008 | 4.9% |
| Other values (48) | 954818 |
protocol
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EML |
|---|---|
| 2nd row | EML |
| 3rd row | EML |
| 4th row | EML |
| 5th row | EML |
| Value | Count | Frequency (%) |
| eml | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 186529 | |
| M | 186529 | |
| L | 186529 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 559587 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 186529 | |
| M | 186529 | |
| L | 186529 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 559587 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 186529 | |
| M | 186529 | |
| L | 186529 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 559587 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 186529 | |
| M | 186529 | |
| L | 186529 |
lastParsed
Text
| Distinct | 18062 |
|---|---|
| Distinct (%) | 9.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99661179 |
| Min length | 20 |
Unique
| Unique | 1510 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | 2025-01-07T13:09:13.317Z |
|---|---|
| 2nd row | 2025-01-07T13:09:13.317Z |
| 3rd row | 2025-01-07T13:09:13.317Z |
| 4th row | 2025-01-07T13:09:13.317Z |
| 5th row | 2025-01-07T13:09:13.318Z |
| Value | Count | Frequency (%) |
| 2025-01-07t13:09:09.691z | 61 | < 0.1% |
| 2025-01-07t13:09:14.842z | 55 | < 0.1% |
| 2025-01-07t13:09:13.153z | 54 | < 0.1% |
| 2025-01-07t13:09:14.841z | 53 | < 0.1% |
| 2025-01-07t13:09:14.268z | 52 | < 0.1% |
| 2025-01-07t13:09:10.471z | 51 | < 0.1% |
| 2025-01-07t13:09:13.231z | 51 | < 0.1% |
| 2025-01-07t13:09:13.898z | 51 | < 0.1% |
| 2025-01-07t13:09:12.942z | 51 | < 0.1% |
| 2025-01-07t13:09:14.350z | 50 | < 0.1% |
| Other values (18052) | 186000 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 902065 | |
| 1 | 544038 | |
| 2 | 450890 | |
| - | 373058 | |
| : | 373058 | |
| 3 | 270263 | 6.0% |
| 5 | 267401 | 6.0% |
| 7 | 257877 | 5.8% |
| 9 | 254149 | 5.7% |
| T | 186529 | 4.2% |
| Other values (5) | 596736 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3170519 | |
| Other Punctuation | 559429 | 12.5% |
| Dash Punctuation | 373058 | 8.3% |
| Uppercase Letter | 373058 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 902065 | |
| 1 | 544038 | |
| 2 | 450890 | |
| 3 | 270263 | 8.5% |
| 5 | 267401 | 8.4% |
| 7 | 257877 | 8.1% |
| 9 | 254149 | 8.0% |
| 4 | 80989 | 2.6% |
| 8 | 73891 | 2.3% |
| 6 | 68956 | 2.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 373058 | |
| . | 186371 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 186529 | |
| Z | 186529 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 373058 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4103006 | |
| Latin | 373058 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 902065 | |
| 1 | 544038 | |
| 2 | 450890 | |
| - | 373058 | |
| : | 373058 | |
| 3 | 270263 | 6.6% |
| 5 | 267401 | 6.5% |
| 7 | 257877 | 6.3% |
| 9 | 254149 | 6.2% |
| . | 186371 | 4.5% |
| Other values (3) | 223836 | 5.5% |
Latin
| Value | Count | Frequency (%) |
| T | 186529 | |
| Z | 186529 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4476064 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 902065 | |
| 1 | 544038 | |
| 2 | 450890 | |
| - | 373058 | |
| : | 373058 | |
| 3 | 270263 | 6.0% |
| 5 | 267401 | 6.0% |
| 7 | 257877 | 5.8% |
| 9 | 254149 | 5.7% |
| T | 186529 | 4.2% |
| Other values (5) | 596736 |
lastCrawled
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2025-01-07T13:01:58.967Z |
|---|---|
| 2nd row | 2025-01-07T13:01:58.967Z |
| 3rd row | 2025-01-07T13:01:58.967Z |
| 4th row | 2025-01-07T13:01:58.967Z |
| 5th row | 2025-01-07T13:01:58.967Z |
| Value | Count | Frequency (%) |
| 2025-01-07t13:01:58.967z | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 746116 | |
| 1 | 559587 | |
| 2 | 373058 | |
| 5 | 373058 | |
| - | 373058 | |
| 7 | 373058 | |
| : | 373058 | |
| T | 186529 | 4.2% |
| 3 | 186529 | 4.2% |
| 8 | 186529 | 4.2% |
| Other values (4) | 746116 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3170993 | |
| Other Punctuation | 559587 | 12.5% |
| Dash Punctuation | 373058 | 8.3% |
| Uppercase Letter | 373058 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 746116 | |
| 1 | 559587 | |
| 2 | 373058 | |
| 5 | 373058 | |
| 7 | 373058 | |
| 3 | 186529 | 5.9% |
| 8 | 186529 | 5.9% |
| 9 | 186529 | 5.9% |
| 6 | 186529 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 373058 | |
| . | 186529 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 186529 | |
| Z | 186529 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 373058 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4103638 | |
| Latin | 373058 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 746116 | |
| 1 | 559587 | |
| 2 | 373058 | |
| 5 | 373058 | |
| - | 373058 | |
| 7 | 373058 | |
| : | 373058 | |
| 3 | 186529 | 4.5% |
| 8 | 186529 | 4.5% |
| . | 186529 | 4.5% |
| Other values (2) | 373058 |
Latin
| Value | Count | Frequency (%) |
| T | 186529 | |
| Z | 186529 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4476696 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 746116 | |
| 1 | 559587 | |
| 2 | 373058 | |
| 5 | 373058 | |
| - | 373058 | |
| 7 | 373058 | |
| : | 373058 | |
| T | 186529 | 4.2% |
| 3 | 186529 | 4.2% |
| 8 | 186529 | 4.2% |
| Other values (4) | 746116 |
repatriated
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 72482 |
| Missing (%) | 38.9% |
| Memory size | 1.4 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.860750392 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | true |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 98166 | |
| true | 15881 | 13.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 114047 | |
| f | 98166 | |
| a | 98166 | |
| l | 98166 | |
| s | 98166 | |
| t | 15881 | 2.9% |
| r | 15881 | 2.9% |
| u | 15881 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 554354 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 114047 | |
| f | 98166 | |
| a | 98166 | |
| l | 98166 | |
| s | 98166 | |
| t | 15881 | 2.9% |
| r | 15881 | 2.9% |
| u | 15881 | 2.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 554354 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 114047 | |
| f | 98166 | |
| a | 98166 | |
| l | 98166 | |
| s | 98166 | |
| t | 15881 | 2.9% |
| r | 15881 | 2.9% |
| u | 15881 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 554354 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 114047 | |
| f | 98166 | |
| a | 98166 | |
| l | 98166 | |
| s | 98166 | |
| t | 15881 | 2.9% |
| r | 15881 | 2.9% |
| u | 15881 | 2.9% |
isSequenced
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 186529 | |
| a | 186529 | |
| l | 186529 | |
| s | 186529 | |
| e | 186529 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 932645 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 186529 | |
| a | 186529 | |
| l | 186529 | |
| s | 186529 | |
| e | 186529 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 932645 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 186529 | |
| a | 186529 | |
| l | 186529 | |
| s | 186529 | |
| e | 186529 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 932645 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 186529 | |
| a | 186529 | |
| l | 186529 | |
| s | 186529 | |
| e | 186529 |
gbifRegion
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 72484 |
| Missing (%) | 38.9% |
| Memory size | 1.4 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 12.75279933 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 104594 | |
| latin_america | 5603 | 4.9% |
| europe | 1857 | 1.6% |
| asia | 998 | 0.9% |
| oceania | 680 | 0.6% |
| africa | 298 | 0.3% |
| antarctica | 15 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 229994 | |
| R | 216961 | |
| I | 117791 | |
| E | 114591 | |
| C | 111205 | |
| N | 110892 | |
| T | 110227 | |
| _ | 110197 | |
| M | 110197 | |
| O | 107131 | |
| Other values (6) | 115207 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1344196 | |
| Connector Punctuation | 110197 | 7.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 229994 | |
| R | 216961 | |
| I | 117791 | |
| E | 114591 | |
| C | 111205 | |
| N | 110892 | |
| T | 110227 | |
| M | 110197 | |
| O | 107131 | |
| H | 104594 | |
| Other values (5) | 10613 | 0.8% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 110197 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1344196 | |
| Common | 110197 | 7.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 229994 | |
| R | 216961 | |
| I | 117791 | |
| E | 114591 | |
| C | 111205 | |
| N | 110892 | |
| T | 110227 | |
| M | 110197 | |
| O | 107131 | |
| H | 104594 | |
| Other values (5) | 10613 | 0.8% |
Common
| Value | Count | Frequency (%) |
| _ | 110197 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1454393 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 229994 | |
| R | 216961 | |
| I | 117791 | |
| E | 114591 | |
| C | 111205 | |
| N | 110892 | |
| T | 110227 | |
| _ | 110197 | |
| M | 110197 | |
| O | 107131 | |
| Other values (6) | 115207 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 186529 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 373058 | |
| A | 373058 | |
| N | 186529 | |
| O | 186529 | |
| T | 186529 | |
| H | 186529 | |
| _ | 186529 | |
| M | 186529 | |
| E | 186529 | |
| I | 186529 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2238348 | |
| Connector Punctuation | 186529 | 7.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 373058 | |
| A | 373058 | |
| N | 186529 | |
| O | 186529 | |
| T | 186529 | |
| H | 186529 | |
| M | 186529 | |
| E | 186529 | |
| I | 186529 | |
| C | 186529 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 186529 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2238348 | |
| Common | 186529 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 373058 | |
| A | 373058 | |
| N | 186529 | |
| O | 186529 | |
| T | 186529 | |
| H | 186529 | |
| M | 186529 | |
| E | 186529 | |
| I | 186529 | |
| C | 186529 |
Common
| Value | Count | Frequency (%) |
| _ | 186529 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2424877 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 373058 | |
| A | 373058 | |
| N | 186529 | |
| O | 186529 | |
| T | 186529 | |
| H | 186529 | |
| _ | 186529 | |
| M | 186529 | |
| E | 186529 | |
| I | 186529 |
level0Gid
Text
Missing 
| Distinct | 77 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 86228 |
| Missing (%) | 46.2% |
| Memory size | 1.4 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | USA |
|---|---|
| 2nd row | USA |
| 3rd row | CAN |
| 4th row | USA |
| 5th row | USA |
| Value | Count | Frequency (%) |
| usa | 89645 | |
| can | 5409 | 5.4% |
| mex | 903 | 0.9% |
| pri | 834 | 0.8% |
| chn | 726 | 0.7% |
| gbr | 460 | 0.5% |
| bmu | 334 | 0.3% |
| fra | 241 | 0.2% |
| ecu | 222 | 0.2% |
| bhs | 181 | 0.2% |
| Other values (67) | 1346 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 95752 | |
| U | 90542 | |
| S | 90036 | |
| C | 6542 | 2.2% |
| N | 6422 | 2.1% |
| R | 1743 | 0.6% |
| M | 1438 | 0.5% |
| E | 1293 | 0.4% |
| B | 1135 | 0.4% |
| P | 1073 | 0.4% |
| Other values (16) | 4927 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 300903 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 95752 | |
| U | 90542 | |
| S | 90036 | |
| C | 6542 | 2.2% |
| N | 6422 | 2.1% |
| R | 1743 | 0.6% |
| M | 1438 | 0.5% |
| E | 1293 | 0.4% |
| B | 1135 | 0.4% |
| P | 1073 | 0.4% |
| Other values (16) | 4927 | 1.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 300903 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 95752 | |
| U | 90542 | |
| S | 90036 | |
| C | 6542 | 2.2% |
| N | 6422 | 2.1% |
| R | 1743 | 0.6% |
| M | 1438 | 0.5% |
| E | 1293 | 0.4% |
| B | 1135 | 0.4% |
| P | 1073 | 0.4% |
| Other values (16) | 4927 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 300903 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 95752 | |
| U | 90542 | |
| S | 90036 | |
| C | 6542 | 2.2% |
| N | 6422 | 2.1% |
| R | 1743 | 0.6% |
| M | 1438 | 0.5% |
| E | 1293 | 0.4% |
| B | 1135 | 0.4% |
| P | 1073 | 0.4% |
| Other values (16) | 4927 | 1.6% |
level0Name
Text
Missing 
| Distinct | 77 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 86228 |
| Missing (%) | 46.2% |
| Memory size | 1.4 MiB |
Length
| Max length | 27 |
|---|---|
| Median length | 13 |
| Mean length | 12.36356567 |
| Min length | 4 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | United States |
| 3rd row | Canada |
| 4th row | United States |
| 5th row | United States |
| Value | Count | Frequency (%) |
| united | 90105 | |
| states | 89645 | |
| canada | 5409 | 2.8% |
| méxico | 903 | 0.5% |
| puerto | 834 | 0.4% |
| rico | 834 | 0.4% |
| china | 726 | 0.4% |
| kingdom | 460 | 0.2% |
| bermuda | 334 | 0.2% |
| france | 241 | 0.1% |
| Other values (87) | 1995 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 270627 | |
| e | 181844 | |
| a | 109780 | |
| n | 97629 | 7.9% |
| d | 96873 | 7.8% |
| i | 93968 | 7.6% |
| 91185 | 7.4% | |
| s | 90321 | 7.3% |
| U | 90116 | 7.3% |
| S | 89755 | 7.2% |
| Other values (42) | 27980 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 957389 | |
| Uppercase Letter | 191471 | 15.4% |
| Space Separator | 91185 | 7.4% |
| Other Punctuation | 33 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 270627 | |
| e | 181844 | |
| a | 109780 | |
| n | 97629 | 10.2% |
| d | 96873 | 10.1% |
| i | 93968 | 9.8% |
| s | 90321 | 9.4% |
| o | 3638 | 0.4% |
| c | 2406 | 0.3% |
| r | 2348 | 0.2% |
| Other values (17) | 7955 | 0.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 90116 | |
| S | 89755 | |
| C | 6290 | 3.3% |
| M | 992 | 0.5% |
| P | 970 | 0.5% |
| R | 861 | 0.4% |
| B | 595 | 0.3% |
| K | 475 | 0.2% |
| F | 322 | 0.2% |
| A | 263 | 0.1% |
| Other values (12) | 832 | 0.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 22 | |
| , | 11 |
Space Separator
| Value | Count | Frequency (%) |
| 91185 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1148860 | |
| Common | 91218 | 7.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 270627 | |
| e | 181844 | |
| a | 109780 | |
| n | 97629 | 8.5% |
| d | 96873 | 8.4% |
| i | 93968 | 8.2% |
| s | 90321 | 7.9% |
| U | 90116 | 7.8% |
| S | 89755 | 7.8% |
| C | 6290 | 0.5% |
| Other values (39) | 21657 | 1.9% |
Common
| Value | Count | Frequency (%) |
| 91185 | ||
| . | 22 | < 0.1% |
| , | 11 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1239169 | |
| None | 909 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 270627 | |
| e | 181844 | |
| a | 109780 | |
| n | 97629 | 7.9% |
| d | 96873 | 7.8% |
| i | 93968 | 7.6% |
| 91185 | 7.4% | |
| s | 90321 | 7.3% |
| U | 90116 | 7.3% |
| S | 89755 | 7.2% |
| Other values (40) | 27071 | 2.2% |
None
| Value | Count | Frequency (%) |
| é | 903 | |
| Å | 6 | 0.7% |
level1Gid
Text
Missing 
| Distinct | 382 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 86228 |
| Missing (%) | 46.2% |
| Memory size | 1.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.281044057 |
| Min length | 7 |
Unique
| Unique | 71 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | USA.7_1 |
|---|---|
| 2nd row | USA.7_1 |
| 3rd row | CAN.2_1 |
| 4th row | USA.7_1 |
| 5th row | USA.7_1 |
| Value | Count | Frequency (%) |
| usa.7_1 | 61491 | |
| usa.23_1 | 2868 | 2.9% |
| usa.30_1 | 2549 | 2.5% |
| usa.5_1 | 2515 | 2.5% |
| usa.10_1 | 2189 | 2.2% |
| usa.22_1 | 1940 | 1.9% |
| usa.20_1 | 1921 | 1.9% |
| can.2_1 | 1690 | 1.7% |
| usa.33_1 | 1296 | 1.3% |
| usa.48_1 | 1264 | 1.3% |
| Other values (372) | 20578 | 20.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 111747 | |
| . | 100301 | |
| _ | 100293 | |
| A | 95752 | |
| U | 90542 | |
| S | 90036 | |
| 7 | 63831 | |
| 2 | 14786 | 2.0% |
| 3 | 12782 | 1.8% |
| 0 | 7852 | 1.1% |
| Other values (28) | 42374 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 300927 | |
| Decimal Number | 228775 | |
| Other Punctuation | 100301 | 13.7% |
| Connector Punctuation | 100293 | 13.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 95752 | |
| U | 90542 | |
| S | 90036 | |
| C | 6542 | 2.2% |
| N | 6422 | 2.1% |
| R | 1743 | 0.6% |
| M | 1438 | 0.5% |
| E | 1293 | 0.4% |
| B | 1135 | 0.4% |
| P | 1073 | 0.4% |
| Other values (16) | 4951 | 1.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 111747 | |
| 7 | 63831 | |
| 2 | 14786 | 6.5% |
| 3 | 12782 | 5.6% |
| 0 | 7852 | 3.4% |
| 4 | 6773 | 3.0% |
| 5 | 3908 | 1.7% |
| 6 | 2536 | 1.1% |
| 8 | 2307 | 1.0% |
| 9 | 2253 | 1.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 100301 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 100293 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 429369 | |
| Latin | 300927 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 95752 | |
| U | 90542 | |
| S | 90036 | |
| C | 6542 | 2.2% |
| N | 6422 | 2.1% |
| R | 1743 | 0.6% |
| M | 1438 | 0.5% |
| E | 1293 | 0.4% |
| B | 1135 | 0.4% |
| P | 1073 | 0.4% |
| Other values (16) | 4951 | 1.6% |
Common
| Value | Count | Frequency (%) |
| 1 | 111747 | |
| . | 100301 | |
| _ | 100293 | |
| 7 | 63831 | |
| 2 | 14786 | 3.4% |
| 3 | 12782 | 3.0% |
| 0 | 7852 | 1.8% |
| 4 | 6773 | 1.6% |
| 5 | 3908 | 0.9% |
| 6 | 2536 | 0.6% |
| Other values (2) | 4560 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 730296 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 111747 | |
| . | 100301 | |
| _ | 100293 | |
| A | 95752 | |
| U | 90542 | |
| S | 90036 | |
| 7 | 63831 | |
| 2 | 14786 | 2.0% |
| 3 | 12782 | 1.8% |
| 0 | 7852 | 1.1% |
| Other values (28) | 42374 | 5.8% |
level1Name
Text
Missing 
| Distinct | 380 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 86228 |
| Missing (%) | 46.2% |
| Memory size | 1.4 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 11 |
| Mean length | 10.37484173 |
| Min length | 3 |
Unique
| Unique | 71 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Connecticut |
|---|---|
| 2nd row | Connecticut |
| 3rd row | British Columbia |
| 4th row | Connecticut |
| 5th row | Connecticut |
| Value | Count | Frequency (%) |
| connecticut | 61491 | |
| new | 4618 | 4.1% |
| michigan | 2868 | 2.6% |
| hampshire | 2549 | 2.3% |
| california | 2536 | 2.3% |
| florida | 2189 | 2.0% |
| massachusetts | 1940 | 1.7% |
| maine | 1921 | 1.7% |
| columbia | 1815 | 1.6% |
| british | 1690 | 1.5% |
| Other values (428) | 28413 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 148113 | |
| t | 137257 | |
| c | 132193 | |
| i | 98938 | |
| o | 86362 | |
| e | 83832 | |
| u | 69788 | |
| C | 67628 | |
| a | 45481 | 4.4% |
| r | 20585 | 2.0% |
| Other values (64) | 150430 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 916653 | |
| Uppercase Letter | 111765 | 10.7% |
| Space Separator | 11729 | 1.1% |
| Dash Punctuation | 329 | < 0.1% |
| Other Punctuation | 131 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 148113 | |
| t | 137257 | |
| c | 132193 | |
| i | 98938 | |
| o | 86362 | |
| e | 83832 | |
| u | 69788 | |
| a | 45481 | 5.0% |
| r | 20585 | 2.2% |
| s | 20401 | 2.2% |
| Other values (33) | 73703 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 67628 | |
| M | 8798 | 7.9% |
| N | 7769 | 7.0% |
| H | 4127 | 3.7% |
| F | 2321 | 2.1% |
| S | 2246 | 2.0% |
| B | 2016 | 1.8% |
| W | 2003 | 1.8% |
| A | 1861 | 1.7% |
| V | 1821 | 1.6% |
| Other values (17) | 11175 | 10.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 127 | |
| . | 4 | 3.1% |
Space Separator
| Value | Count | Frequency (%) |
| 11729 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 329 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1028418 | |
| Common | 12189 | 1.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 148113 | |
| t | 137257 | |
| c | 132193 | |
| i | 98938 | |
| o | 86362 | |
| e | 83832 | |
| u | 69788 | |
| C | 67628 | |
| a | 45481 | 4.4% |
| r | 20585 | 2.0% |
| Other values (60) | 138241 |
Common
| Value | Count | Frequency (%) |
| 11729 | ||
| - | 329 | 2.7% |
| ' | 127 | 1.0% |
| . | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1038649 | |
| None | 1957 | 0.2% |
| Latin Ext Additional | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 148113 | |
| t | 137257 | |
| c | 132193 | |
| i | 98938 | |
| o | 86362 | |
| e | 83832 | |
| u | 69788 | |
| C | 67628 | |
| a | 45481 | 4.4% |
| r | 20585 | 2.0% |
| Other values (46) | 148472 |
None
| Value | Count | Frequency (%) |
| é | 1146 | |
| í | 368 | 18.8% |
| á | 92 | 4.7% |
| ô | 78 | 4.0% |
| ö | 76 | 3.9% |
| ó | 62 | 3.2% |
| ü | 48 | 2.5% |
| ã | 24 | 1.2% |
| à | 16 | 0.8% |
| ś | 13 | 0.7% |
| Other values (7) | 34 | 1.7% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ộ | 1 |
level2Gid
Text
Missing 
| Distinct | 1864 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 87766 |
| Missing (%) | 47.1% |
| Memory size | 1.4 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 9 |
| Mean length | 9.519212661 |
| Min length | 7 |
Unique
| Unique | 550 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | USA.7.6_1 |
|---|---|
| 2nd row | USA.7.5_1 |
| 3rd row | CAN.2.8_1 |
| 4th row | USA.7.3_1 |
| 5th row | USA.7.2_1 |
| Value | Count | Frequency (%) |
| usa.7.5_1 | 21687 | |
| usa.7.2_1 | 10598 | 10.7% |
| usa.7.3_1 | 8951 | 9.1% |
| usa.7.1_1 | 6380 | 6.5% |
| usa.7.6_1 | 6092 | 6.2% |
| usa.7.4_1 | 4045 | 4.1% |
| usa.7.7_1 | 1936 | 2.0% |
| usa.7.8_1 | 1802 | 1.8% |
| usa.30.4_1 | 1119 | 1.1% |
| can.7.17_1 | 1051 | 1.1% |
| Other values (1854) | 35102 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 197518 | |
| 1 | 127750 | |
| _ | 98763 | |
| A | 95669 | |
| U | 90208 | |
| S | 89851 | |
| 7 | 70699 | 7.5% |
| 2 | 33642 | 3.6% |
| 5 | 32966 | 3.5% |
| 3 | 28520 | 3.0% |
| Other values (28) | 74560 | 7.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 347576 | |
| Uppercase Letter | 296289 | |
| Other Punctuation | 197518 | |
| Connector Punctuation | 98763 | 10.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 95669 | |
| U | 90208 | |
| S | 89851 | |
| C | 6509 | 2.2% |
| N | 6400 | 2.2% |
| E | 1270 | 0.4% |
| M | 1029 | 0.3% |
| X | 903 | 0.3% |
| R | 855 | 0.3% |
| H | 788 | 0.3% |
| Other values (16) | 2807 | 0.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 127750 | |
| 7 | 70699 | |
| 2 | 33642 | 9.7% |
| 5 | 32966 | 9.5% |
| 3 | 28520 | 8.2% |
| 4 | 17906 | 5.2% |
| 6 | 13016 | 3.7% |
| 0 | 10163 | 2.9% |
| 8 | 7593 | 2.2% |
| 9 | 5321 | 1.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 197518 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 98763 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 643857 | |
| Latin | 296289 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 95669 | |
| U | 90208 | |
| S | 89851 | |
| C | 6509 | 2.2% |
| N | 6400 | 2.2% |
| E | 1270 | 0.4% |
| M | 1029 | 0.3% |
| X | 903 | 0.3% |
| R | 855 | 0.3% |
| H | 788 | 0.3% |
| Other values (16) | 2807 | 0.9% |
Common
| Value | Count | Frequency (%) |
| . | 197518 | |
| 1 | 127750 | |
| _ | 98763 | |
| 7 | 70699 | 11.0% |
| 2 | 33642 | 5.2% |
| 5 | 32966 | 5.1% |
| 3 | 28520 | 4.4% |
| 4 | 17906 | 2.8% |
| 6 | 13016 | 2.0% |
| 0 | 10163 | 1.6% |
| Other values (2) | 12914 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 940146 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 197518 | |
| 1 | 127750 | |
| _ | 98763 | |
| A | 95669 | |
| U | 90208 | |
| S | 89851 | |
| 7 | 70699 | 7.5% |
| 2 | 33642 | 3.6% |
| 5 | 32966 | 3.5% |
| 3 | 28520 | 3.0% |
| Other values (28) | 74560 | 7.9% |
level2Name
Text
Missing 
| Distinct | 1522 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 87766 |
| Missing (%) | 47.1% |
| Memory size | 1.4 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 30 |
| Mean length | 8.839798305 |
| Min length | 3 |
Unique
| Unique | 397 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | New London |
|---|---|
| 2nd row | New Haven |
| 3rd row | Columbia-Shuswap |
| 4th row | Litchfield |
| 5th row | Hartford |
| Value | Count | Frequency (%) |
| new | 27815 | |
| haven | 21687 | |
| hartford | 10598 | 7.9% |
| litchfield | 8951 | 6.6% |
| fairfield | 6391 | 4.7% |
| london | 6094 | 4.5% |
| middlesex | 4507 | 3.3% |
| windham | 2098 | 1.6% |
| tolland | 1936 | 1.4% |
| coos | 1128 | 0.8% |
| Other values (1636) | 43795 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 97796 | 11.2% |
| a | 78125 | 8.9% |
| n | 61952 | 7.1% |
| i | 58663 | 6.7% |
| o | 50830 | 5.8% |
| d | 50373 | 5.8% |
| r | 46106 | 5.3% |
| l | 36292 | 4.2% |
| 36237 | 4.2% | |
| t | 35119 | 4.0% |
| Other values (86) | 321552 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 695469 | |
| Uppercase Letter | 136523 | 15.6% |
| Space Separator | 36237 | 4.2% |
| Dash Punctuation | 3055 | 0.3% |
| Decimal Number | 962 | 0.1% |
| Other Punctuation | 761 | 0.1% |
| Close Punctuation | 19 | < 0.1% |
| Open Punctuation | 19 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 97796 | |
| a | 78125 | |
| n | 61952 | |
| i | 58663 | 8.4% |
| o | 50830 | 7.3% |
| d | 50373 | 7.2% |
| r | 46106 | 6.6% |
| l | 36292 | 5.2% |
| t | 35119 | 5.0% |
| w | 29400 | 4.2% |
| Other values (39) | 150813 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 34544 | |
| N | 29570 | |
| L | 17533 | |
| M | 8839 | 6.5% |
| F | 7664 | 5.6% |
| C | 6778 | 5.0% |
| S | 5466 | 4.0% |
| W | 3577 | 2.6% |
| T | 3191 | 2.3% |
| B | 2699 | 2.0% |
| Other values (20) | 16662 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 492 | |
| 0 | 145 | 15.1% |
| 5 | 105 | 10.9% |
| 7 | 67 | 7.0% |
| 6 | 66 | 6.9% |
| 3 | 26 | 2.7% |
| 9 | 24 | 2.5% |
| 8 | 18 | 1.9% |
| 2 | 13 | 1.4% |
| 4 | 6 | 0.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 612 | |
| ' | 148 | 19.4% |
| / | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 36237 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3055 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 19 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 19 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 831992 | |
| Common | 41053 | 4.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 97796 | 11.8% |
| a | 78125 | 9.4% |
| n | 61952 | 7.4% |
| i | 58663 | 7.1% |
| o | 50830 | 6.1% |
| d | 50373 | 6.1% |
| r | 46106 | 5.5% |
| l | 36292 | 4.4% |
| t | 35119 | 4.2% |
| H | 34544 | 4.2% |
| Other values (69) | 282192 |
Common
| Value | Count | Frequency (%) |
| 36237 | ||
| - | 3055 | 7.4% |
| . | 612 | 1.5% |
| 1 | 492 | 1.2% |
| ' | 148 | 0.4% |
| 0 | 145 | 0.4% |
| 5 | 105 | 0.3% |
| 7 | 67 | 0.2% |
| 6 | 66 | 0.2% |
| 3 | 26 | 0.1% |
| Other values (7) | 100 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 871868 | |
| None | 1156 | 0.1% |
| Latin Ext Additional | 21 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 97796 | 11.2% |
| a | 78125 | 9.0% |
| n | 61952 | 7.1% |
| i | 58663 | 6.7% |
| o | 50830 | 5.8% |
| d | 50373 | 5.8% |
| r | 46106 | 5.3% |
| l | 36292 | 4.2% |
| 36237 | 4.2% | |
| t | 35119 | 4.0% |
| Other values (59) | 320375 |
None
| Value | Count | Frequency (%) |
| é | 551 | |
| ô | 244 | |
| á | 150 | 13.0% |
| í | 64 | 5.5% |
| ó | 39 | 3.4% |
| ñ | 30 | 2.6% |
| ú | 14 | 1.2% |
| ł | 13 | 1.1% |
| Đ | 7 | 0.6% |
| ö | 7 | 0.6% |
| Other values (13) | 37 | 3.2% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ả | 7 | |
| ạ | 7 | |
| ắ | 5 | |
| ồ | 2 | 9.5% |
level3Gid
Text
Missing 
| Distinct | 728 |
|---|---|
| Distinct (%) | 9.5% |
| Missing | 178900 |
| Missing (%) | 95.9% |
| Memory size | 1.4 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 13 |
| Mean length | 12.10446979 |
| Min length | 11 |
Unique
| Unique | 262 ? |
|---|---|
| Unique (%) | 3.4% |
Sample
| 1st row | CAN.2.8.6_1 |
|---|---|
| 2nd row | GBR.3.1.1_1 |
| 3rd row | CAN.7.18.4_1 |
| 4th row | CAN.11.87.11_1 |
| 5th row | FRA.13.2.1_1 |
| Value | Count | Frequency (%) |
| can.7.17.2_1 | 1048 | 13.7% |
| can.2.3.5_1 | 207 | 2.7% |
| can.9.35.1_1 | 173 | 2.3% |
| chn.14.9.8_1 | 163 | 2.1% |
| can.11.88.5_1 | 163 | 2.1% |
| gbr.1.20.1_1 | 153 | 2.0% |
| can.2.9.12_1 | 152 | 2.0% |
| can.2.8.2_1 | 136 | 1.8% |
| can.11.58.5_1 | 135 | 1.8% |
| chn.13.8.1_1 | 130 | 1.7% |
| Other values (718) | 5169 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 22887 | |
| 1 | 17043 | |
| _ | 7629 | 8.3% |
| C | 6398 | 6.9% |
| N | 6301 | 6.8% |
| A | 5812 | 6.3% |
| 2 | 5632 | 6.1% |
| 7 | 3254 | 3.5% |
| 3 | 3181 | 3.4% |
| 8 | 2209 | 2.4% |
| Other values (24) | 11999 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 38942 | |
| Other Punctuation | 22887 | |
| Uppercase Letter | 22887 | |
| Connector Punctuation | 7629 | 8.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 6398 | |
| N | 6301 | |
| A | 5812 | |
| R | 784 | 3.4% |
| H | 780 | 3.4% |
| B | 491 | 2.1% |
| G | 477 | 2.1% |
| U | 360 | 1.6% |
| E | 354 | 1.5% |
| F | 347 | 1.5% |
| Other values (12) | 783 | 3.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 17043 | |
| 2 | 5632 | 14.5% |
| 7 | 3254 | 8.4% |
| 3 | 3181 | 8.2% |
| 8 | 2209 | 5.7% |
| 5 | 2177 | 5.6% |
| 4 | 1946 | 5.0% |
| 9 | 1472 | 3.8% |
| 6 | 1104 | 2.8% |
| 0 | 924 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 22887 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7629 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 69458 | |
| Latin | 22887 | 24.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 6398 | |
| N | 6301 | |
| A | 5812 | |
| R | 784 | 3.4% |
| H | 780 | 3.4% |
| B | 491 | 2.1% |
| G | 477 | 2.1% |
| U | 360 | 1.6% |
| E | 354 | 1.5% |
| F | 347 | 1.5% |
| Other values (12) | 783 | 3.4% |
Common
| Value | Count | Frequency (%) |
| . | 22887 | |
| 1 | 17043 | |
| _ | 7629 | 11.0% |
| 2 | 5632 | 8.1% |
| 7 | 3254 | 4.7% |
| 3 | 3181 | 4.6% |
| 8 | 2209 | 3.2% |
| 5 | 2177 | 3.1% |
| 4 | 1946 | 2.8% |
| 9 | 1472 | 2.1% |
| Other values (2) | 2028 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 92345 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 22887 | |
| 1 | 17043 | |
| _ | 7629 | 8.3% |
| C | 6398 | 6.9% |
| N | 6301 | 6.8% |
| A | 5812 | 6.3% |
| 2 | 5632 | 6.1% |
| 7 | 3254 | 3.5% |
| 3 | 3181 | 3.4% |
| 8 | 2209 | 2.4% |
| Other values (24) | 11999 |
level3Name
Text
Missing 
| Distinct | 722 |
|---|---|
| Distinct (%) | 9.5% |
| Missing | 178901 |
| Missing (%) | 95.9% |
| Memory size | 1.4 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 13.07236497 |
| Min length | 3 |
Unique
| Unique | 255 ? |
|---|---|
| Unique (%) | 3.3% |
Sample
| 1st row | Columbia-Shuswap E |
|---|---|
| 2nd row | Aberdeen |
| 3rd row | Yarmouth Town |
| 4th row | Pont-Rouge |
| 5th row | Grasse |
| Value | Count | Frequency (%) |
| subd | 1358 | 8.8% |
| b | 1147 | 7.5% |
| victoria | 1108 | 7.2% |
| no | 454 | 3.0% |
| division | 344 | 2.2% |
| c | 287 | 1.9% |
| h | 282 | 1.8% |
| capital | 230 | 1.5% |
| part | 224 | 1.5% |
| a | 213 | 1.4% |
| Other values (819) | 9726 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8445 | 8.5% |
| 7745 | 7.8% | |
| i | 6940 | 7.0% |
| o | 6927 | 6.9% |
| n | 5507 | 5.5% |
| e | 5270 | 5.3% |
| r | 4860 | 4.9% |
| t | 4774 | 4.8% |
| u | 4100 | 4.1% |
| l | 3351 | 3.4% |
| Other values (89) | 41797 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 69753 | |
| Uppercase Letter | 15729 | 15.8% |
| Space Separator | 7745 | 7.8% |
| Other Punctuation | 3541 | 3.6% |
| Dash Punctuation | 1261 | 1.3% |
| Decimal Number | 1022 | 1.0% |
| Open Punctuation | 329 | 0.3% |
| Close Punctuation | 326 | 0.3% |
| Final Punctuation | 10 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8445 | |
| i | 6940 | |
| o | 6927 | |
| n | 5507 | 7.9% |
| e | 5270 | 7.6% |
| r | 4860 | 7.0% |
| t | 4774 | 6.8% |
| u | 4100 | 5.9% |
| l | 3351 | 4.8% |
| s | 2470 | 3.5% |
| Other values (39) | 17109 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2629 | |
| B | 1955 | |
| C | 1816 | |
| V | 1358 | 8.6% |
| D | 800 | 5.1% |
| N | 747 | 4.7% |
| L | 669 | 4.3% |
| A | 604 | 3.8% |
| T | 572 | 3.6% |
| H | 546 | 3.5% |
| Other values (20) | 4033 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 330 | |
| 2 | 245 | |
| 0 | 154 | |
| 9 | 76 | 7.4% |
| 7 | 66 | 6.5% |
| 6 | 60 | 5.9% |
| 3 | 31 | 3.0% |
| 4 | 28 | 2.7% |
| 8 | 21 | 2.1% |
| 5 | 11 | 1.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1855 | |
| , | 1615 | |
| ' | 65 | 1.8% |
| / | 4 | 0.1% |
| * | 2 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 7745 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1261 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 329 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 326 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 85482 | |
| Common | 14234 | 14.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8445 | 9.9% |
| i | 6940 | 8.1% |
| o | 6927 | 8.1% |
| n | 5507 | 6.4% |
| e | 5270 | 6.2% |
| r | 4860 | 5.7% |
| t | 4774 | 5.6% |
| u | 4100 | 4.8% |
| l | 3351 | 3.9% |
| S | 2629 | 3.1% |
| Other values (69) | 32679 |
Common
| Value | Count | Frequency (%) |
| 7745 | ||
| . | 1855 | 13.0% |
| , | 1615 | 11.3% |
| - | 1261 | 8.9% |
| 1 | 330 | 2.3% |
| ( | 329 | 2.3% |
| ) | 326 | 2.3% |
| 2 | 245 | 1.7% |
| 0 | 154 | 1.1% |
| 9 | 76 | 0.5% |
| Other values (10) | 298 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 99345 | |
| None | 339 | 0.3% |
| Latin Ext Additional | 22 | < 0.1% |
| Punctuation | 10 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8445 | 8.5% |
| 7745 | 7.8% | |
| i | 6940 | 7.0% |
| o | 6927 | 7.0% |
| n | 5507 | 5.5% |
| e | 5270 | 5.3% |
| r | 4860 | 4.9% |
| t | 4774 | 4.8% |
| u | 4100 | 4.1% |
| l | 3351 | 3.4% |
| Other values (61) | 41426 |
None
| Value | Count | Frequency (%) |
| é | 132 | |
| è | 77 | |
| Î | 28 | 8.3% |
| ä | 21 | 6.2% |
| É | 14 | 4.1% |
| ł | 13 | 3.8% |
| ñ | 9 | 2.7% |
| ú | 7 | 2.1% |
| ư | 6 | 1.8% |
| ơ | 6 | 1.8% |
| Other values (12) | 26 | 7.7% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ế | 13 | |
| ắ | 5 | 22.7% |
| ờ | 2 | 9.1% |
| ọ | 1 | 4.5% |
| ỷ | 1 | 4.5% |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 10 |
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10881 |
| Missing (%) | 5.8% |
| Memory size | 1.4 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NE |
|---|---|
| 2nd row | NE |
| 3rd row | LC |
| 4th row | NE |
| 5th row | NE |
| Value | Count | Frequency (%) |
| ne | 150656 | |
| lc | 24131 | 13.7% |
| dd | 199 | 0.1% |
| nt | 177 | 0.1% |
| cr | 177 | 0.1% |
| en | 175 | 0.1% |
| vu | 127 | 0.1% |
| ex | 5 | < 0.1% |
| ew | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 151008 | |
| E | 150837 | |
| C | 24308 | 6.9% |
| L | 24131 | 6.9% |
| D | 398 | 0.1% |
| T | 177 | 0.1% |
| R | 177 | 0.1% |
| V | 127 | < 0.1% |
| U | 127 | < 0.1% |
| X | 5 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 351296 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 151008 | |
| E | 150837 | |
| C | 24308 | 6.9% |
| L | 24131 | 6.9% |
| D | 398 | 0.1% |
| T | 177 | 0.1% |
| R | 177 | 0.1% |
| V | 127 | < 0.1% |
| U | 127 | < 0.1% |
| X | 5 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 351296 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 151008 | |
| E | 150837 | |
| C | 24308 | 6.9% |
| L | 24131 | 6.9% |
| D | 398 | 0.1% |
| T | 177 | 0.1% |
| R | 177 | 0.1% |
| V | 127 | < 0.1% |
| U | 127 | < 0.1% |
| X | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 351296 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 151008 | |
| E | 150837 | |
| C | 24308 | 6.9% |
| L | 24131 | 6.9% |
| D | 398 | 0.1% |
| T | 177 | 0.1% |
| R | 177 | 0.1% |
| V | 127 | < 0.1% |
| U | 127 | < 0.1% |
| X | 5 | < 0.1% |